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(54) Title: A METHOD FOR EXTRACTING QUANTITATIVE INFORMATION RELATING TO AN INFLUENCE ON A CELLULAR 
RESPONSE 

(57) Abstract 

Cells are genetically modified to expresss a luminophorc, e.g., a modified (F64L, S65T, Y66H) Green Fluorescent Protein (GFP, 
EGFP) coupled to a component of an intracellular signalling pathway such as a transcription factor, a cGMP- or cAMP-dependent protein 
kinase, a cycUn- calmodulin- or phospholipid-depcndent or mitogen-actlvated serine/threonin protein Idnase, a tyrosine protein kinase, or 
a protein phosphatase (e.g. PKA, PKC, Erk, Smad, VASP, actin, p38, Jnkl, PKG. IkappaB, CDK2, Grk5, Zap70. p85, protein-tyrosinc 
phosphatase IC, Stat5, NFAT, NFkappaB, RhoA, PKB). An influence modulates the intracellular signalling pathway in such a way that 
the luminophore is being redistributed or translocated with the component in living cells in a manner experimentally determined to be 
conrclated to the degree of the influence. Measurement of redistribution is performed by recording of light intensity, fluorescence lifetime, 
polarization, wavelength shift, resonance energy transfer, or other properties by an apparatus consisting of e.g. a fluorescence microscope 
and a CCD camera. Data stored as digital images are processed to numbers representing the degree of redistribution. The method can be 
used as a screening program for identifying a compound that modulates a component and is capable of treating a disease related to the 
function of the component. 



I 



1 



4 



FOR THE PURPOSES OF INFORMATION ONLY 





Codes used to identify States party to the PCT on the front pages 


of pamphlets publishing international applications undei 


XL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


\ r. 




BY 


Belarus 


IS 


Iceland 


\n\ 








CA 


Canada 












. . t - 










IM. 


Poland 










Kk 


Hepiiblic of Korfa 


PT 


Portugal 








Cub.1 


KZ 


Kazaistan 


RO 


Romania 






cz 


C7pch Republic 


LC 


Saini Ljcia 


RL 


Russian Frdfrrvtirn 






DF, 


Gcma.iy 


LI 


Liechtenstein 










DK 


Denmark: 


! K 













wo 98/45704 



PCT/DK98/00145 



1 

A METHOD for extracting quantitative infornnation relating to an influence on a cellular re- 
sponse 



FIELD OF INVENTION 

5 The present invention relates to a method and tools for extracting quantitative infornnation 
relating to an influence, on a cellular response, in particular an Influence caused by contac- 
ting or incubating the cell with a substance influencing a cellular response, where the cellular 
response is manifested in redistribution of at least one component in the cell. In particular, 
the invention relates to a method for extracting quantitative information relating to an influen- 

10 ce on an intracellular pathway involving redistribution of at least one component associated 
with the pathway. The method of the invention may be used as a very efficient procedure for 
testing or discovering the influence of a substance on a physiological process, for example in 
connection with screening for new drugs, testing of substances for toxicity, identifying drug 
targets for known or novel drugi? Other valuable uses of the method and technology of the 

15 invention will be apparent to the skilled person on the basis of the following disclosure. In a 
particular embodiment of the invention, the present invention relates to a method of detec- 
ting intracellular translocation or redistribution of biologically active polypeptides, preferably 
an enzyme, affecting intracellular processes, and a DNA construct and a cell for use in the 
method, 

20 

BACKGROUND OF THE INVENTION 

Intracellular pathways are tightly regulated by a cascade of components that undergo 
modulation in a temporally and spatially characteristic manner. Several disease states can 
be attributed to altered activity of individual signalling components (i.e. protein kinases, pro- 
25 tein phosphatases, transcription factors). These components therefore render themselves as 
attractive targets for therapeutic intervention. 

Protein kinases and phosphatases are wet! describeri nnn^nop'^^*^ ^r. , - 

- n k nases n ceiiuidf 3igna,iing ana regulation have been subjected to extensive studies, 
detailed knowledge on e.g. the exact timing and spatial characteristics of signalling events is 
often difficult to obtain due *o innk of n r^^nvf-^ r.rt trr-^r^n'o^:'/ 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



2 

Novel ways of monitoring specific modulation of intracellular pathways in intact, living cells is 
assumed to provide new opportunities in drug discovery, functional genomics, toxicology, 
patient monitoring etc. 

The spatial orchestration of protein kinase activity is likely to be essential for the high degree 
5 of specificity of individual protein kinases. The phosphorylation mediated by protein kinases 
is balanced by phosphatase activity. Also within the family of phosphatases translocation has 
been observed, e.g. translocation of PTP2C to membrane ruffles [(Cossette ef a/.1996)], and 
likewise is likely to be indicative of phosphatase activity. 

Protein kinases often show a specific intracellular distribution before, during and after activa- 
10 tion. Monitoring the translocation processes and/or redistribution of individual protein kinases 
or subunits thereof is thus likely to be indicative of their functional activity. A connection be- 
tween translocation and catalytic activation has been shown for protein kinases like the dia- 
cyl glycerol (DAG)-dependent protein kinase C (PKC), the cAMP-dependent protein kinase 
(PKA) [(DeBernardi ef aA1996)] and the mitogen-activated-protein kinase Erk-1 [(Sano et 
15 a/. 1995)]. 

Commonly used methods of detection of intraceiiuiar localisation/activity of protein kinases 
and phosphatases are immunoprecipitation, Western blotting and immunocytochemical de- 
tection. 

Taking the family of diacyl glycerol {DAG)-dependent protein kinase Cs (PKCs) as an exam- 
20 pie, it has been shown that individual PKC isoforms that are distributed among different tis- 
sues and cells have different activator requirements and undergo differentia! translocation in 
response to activation. Catalytically inactive DAG-dependent PKCs are generally distributed 
throughout the cytoplasm, whereas they upon activation translocate to become associated 
with different cellular components, e.g. piasma membrane [{Farese, 1992),(Fulop Jr. et 
25 a/.1995)] nucleus [(Khaiil et a/.1992)], cytoskeleton [{Blobe et a/.1996)]. The translocation 
phenomenon being indicative of PKC activation has been monitored using different ap- 
proaches: a) immunocytochemistry where the localisation of individual isoforms can be de- 
tected after permeabilisation and fixation of the cells f^Khalil ^onnM . . 

n A, r> . ; ...... 

.. I": u:o rjcroonorr- - ., ■ [instiae; 

■ '^-^Gn^j 3r.c :_: g^^Mei^ lagging of PKCn ([Schmidt et al. 1997]) and of PKCy and PKC 

r. ([Sakai et ai 1996]). The first method does not provide dynamic information whereas the 
latter methods will. Taqqinn PKC, with i^-es^^':-'^*;'. in:^^^';^^:': v-^. — : ' - ■ ^^r^ -..^ ■ 
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distinguish between different DAG-dependent isoforms of PKC but will label and show 
movement of all isoforms. Chemical and genetic labelling of specific DAG-dependent PKCs 
confirmed that they in an isoform specific manner upon activation move to cell periphery or 
nucleus. 

5 In an alternative method, protein kinase A activity has been measured in living cells by 

chemical labelling one of the kinase's subunit (Adams et a/. 1991). The basis of the method- 
ology is that the regulatory and catalytic subunit of purified protein kinase A is labelled with 
fluorescein and rhodamine, respectively. At low cAMF levels protein kinase A is assembled 
in a heterotetrameric form which enables fluorescence resonance energy transfer between 

10 the two fluorescent dyes. Activation of protein kinase A leads to dissociation of the complex, 
thereby eliminating the energy transfer. A disadvantage of this technology is that the labelled 
protein kinase A has to be microinjected into the cells of interest. This highly invasive tech- 
nique is cumbersome and not applicable to large scale screening of biologically active sub- 
stances. A further disadvantage of this technique as compared to the presented invention is 

15 that the labelled protein kinase A cannot be inserted into organisms/animals as a transgene. 

Recently it was discovered that Green Fluorescent Protein (GFP) expressed in many differ- 
ent cell types, including mammalian cells, became highly fluorescent [(Ghalfie et a/. 1994)]. 
WO95/07463 describes a cell capable of expressing GFP and a method for detecting a pro- 
tein of interest in a cell based on introducing into a cell a DNA molecule having DNA se- 

20 quence encoding the protein of interest linked to DNA sequence encoding a GFP such that 
the protein produced by the DNA molecule will have the protein of interest fused to the GFP, 
then culturing the cells in conditions permitting expression of the fused protein and detecting 
the location of the fluorescence in the cell, thereby localizing the protein of interest in the 
cell. However, examples of such fused proteins are not provided, and the use of fusion pro- 

25 teins with GFP for detection or quantitation of translocation or redistribution of biologically 
active polypeptides affecting intracellular processes upon activation, such as proteins in- 
volved in signalling pathways, e.g. protein kinases or phosphatases, has not been sug- 
gested. WO 95/07463 further describes cells useful for the detection of molecules, such 
hormones or heavy metals, in a biohoi^n' ^ — 

.,..,u-v/v K:it^men: wnirn ir ":r'- Vviii afieci tne expression of the 
1-^ tins way tne gene encoding GFP is used as a reporter gene in a cell which is con- 
structed for monitoring the presence of a specific molecular Identity 
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Green Fluorescent Protein has been used in an assay for the detection of translocation of 
the glucocorticoid receptor (GR) [Carey, KL et al., The Journal of Cell Biology, Vol. 133, No. 
5. p. 985-996 (1996)]. A GR-S65TGFP fusion has been used to study the mechanisms in- 
volved in translocation of the glucocorticoid receptor (GR) in response to the agonist dex- 
5 amethasone from the cytosol, where it is present in the absence of a ligand, through the nu- 
clear pore to the nucleus where it remains after ligand binding. The use of a GR-GFP fusion 
enables real-time imaging and quantitation of nuclear/cytoplasmic ratios of the fluorescence 
signal. 

Many currently used screening programmes designed to find compounds that affect protein 
1 0 kinase activity are based on measurements of kinase phosphorylation of artificial or natural 
substrates, receptor binding and/or reporter gene expression. 

DISCLOSURE OF THE INVENTION 

The present invention provides an important new dimension in the investigation of cellular 
1 5 systems involving redistribution in that the invention provides quantification of the redisthbu- 
tion responses or events caused by an influence, typically contact with a chemical substance 
or mixture of chemical substances, but also changes in the physical environment. The quan- 
tification makes it possible to set up meaningful relationships, expressed numerically, or as 
curves or graphs, between the influences (or the degree of influences) on cellular systems 
20 and the redistribution response. This is highly advantageous because, as has been found, 
the quantification can be achieved in both a fast and reproducible manner, and - what is per- 
haps even more important - the systems which become quantifiable utilizing the method of 
the invention are systems from which enormous amounts of new information and insight can 
be dehved. 

25 The present screening assays have the distinct advantage over other screening assays, 
e.g., receptor binding assays, enzymatic assays, and reporter gene assays, in providing a 
system in which biologically active substances with completely novel modes of action, e.a 

inhibition nr r^rr>r^n*-'^ .- f ^ , . , 

■ wriv TKi: i'^sures v^-- ^^'c^- sG;cctiViL^' iruj particular isotorrii or the Diologicalty ac- 
tive polypeptide and further development of compound selectivity versus other isoforms of 
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the same biologically active polypeptide or other components of the same signalling pa- 
thway. 

In its broadest aspect, the invention relates to a method for extracting quantitative informati- 
on relating to an influence on a cellular response, the method comprising recording variation, 
5 caused by the influence on a mechanically intact living cell or mechanically intact living cells, 
in spatially distributed light emitted from a luminophore, the luminophore being present in the 
cell or cells and being capable of being redistributed in a manner which is related with the 
degree of the influence, and/or of being modulated by a component which is capable of be- 
ing redistributed in a manner which is related to the degree of the influence, the association 

10 resulting in a modulation of the luminescence characteristics of the luminophore, detecting 
and recording the spatially distributed light from the luminophore, and processing the recor- 
ded variation in the spatially distributed light to provide quantitative Information correlating 
the spatial distribution or change in the spatial distribution to the degree of the influence. In a 
preferred embodiment of the invention the luminophore, which is present in the cell or cells, 

15 is capable of being redistributed by modulation of an intracellular pathway, in a manner 

which is related to the redistribution of at least one component of the intracellular pathway. In 
another preferred embodiment of the invention, the luminophore is a fluorophore. 

The cells 

20 In the invention the cell and/or cells are mechanically intact and alive throughout the experi- 
ment. In another embodiment of the invention, the cell or cells is/are fixed at a point in time 
after the application of the influence at which the response has been predetermined to be 
significant, and the recording is made at an arbitrary later time. 

The mechanically intact living ceil or cells could be selected from the group consisting of 
25 fungal cell or cells, such as a yeast cell or cells; invertebrate cell or cells including insect cell 
or cells; and vertebrate cell or cells, such as mammalian cell or cells. This cell or these cells 
is/are incubated at a temperature of 30°C or above, preferably at a temperature of from 32°C 
to 39°C, more preferably at a temoeratnrp — 

,::a, , ininc! : v^ail ^; a fnairix or laenticai cr 
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A cell used in the present invention should contain a nucleic acid construct encoding a fusion 
polypeptide as defined herein and be capable of expressing the sequence encoded by the 
construct. The ceil is a eukaryotic cell selected from the group consisting of fungal cells, 
such as yeast cells; invertebrate cells including insect cells; vertebrate cells such as mam- 
5 malian cells. The preferred cells are mammalian cells. 

In another aspect of the invention the cells could be from an organism carrying in at least 
one of its component cells a nucleic acid sequence encoding a fusion polypeptide as defined 
herein and be capable of expressing said nucleiu acid sequence. The organism is selected 
from the group consisting of unicellular and multicellular organisms, such as a mammal. 

10 

The luminophore 

The luminophore is the component which allows the redistribution to be visualised and/or 
recorded by emitting light in a spatial distribution related to the degree of influence. In one 
embodiment of the invention, the luminophore is capable of being redistributed in a manner 

1 5 which is physiologically relevant to the degree of the influence. In another embodiment, the 
luminophore is capable of associating with a component which is capable of being redistri- 
buted in a manner which is physiologically relevant to the degree of the influence. In another 
embodiment, the luminophore con-elation between the redistribution of the luminophore and 
the degree of the influence could be determined experimentally. In a preferred aspect of the 

20 invention, the luminophore is capable of being redistributed in substantially the same manner 
as the at least one component of an intracellular pathway. In yet another embodiment of the 
invention, the luminophore is capable of being quenched upon spatial association with a 
component which is redistributed by modulation of the pathway, the quenching being measu- 
red as a change in the intensity of the luminescence, 

25 The luminophore could be a fluorophore. In a preferred embodiment of the invention, the lu- 
minophore could be a polypeptide encoded by and expressed from a nucleotide sequence 
harboured in the cell or cells. The luminophore could be a hybrid polypeptide comprisinn n 
fusion of at least a portion of earh ^v - — ' ■ 



i ne luminescent polypeptide could be a GFP as defined herein or could be selected from the 
group consisting of green fluorescent proteins havina \^.r F^Ai -v,;t:]^ : ^ - : ,■ 
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such as F64L-GFP, F64L-Y66H-GFP, F64L-S65T-GFP, and EGFP. The GFP could be N- or 
C-terminally tagged, optionally via a peptide linker, to the biologically active polypeptide or a 
part or a subunit thereof. The fluorescent probe could be a component of a intracellular sig- 
nalling pathway. The probe is coded for by a nucleic acid construct. 

5 The pathway of investigation in the present invention could be an intracellular signalling pa- 
thway. 



I he influence 

In a preferred embodiment of the invention, the influence could be contact between the me- 
10 chanically intact living cell or the group of mechanically intact living cells with a chemical 
substance and/or incubation of the mechanically intact living cell or the group of mechani- 
cally intact living cells with a chemical substance. The influence will modulate the intracellular 
processes. In one aspect the modulation could be an activation of the intracellular proces- 
ses. In another aspect the modulation could be an deactivation of the intracellular processes. 
15 In yet another aspect, the influence could inhibit or promote the redistribution without directly 
affecting the metabolic activity of the component of the intracellular processes. 

In one embodiment the invention is used as a basis for a screening program, where the ef- 
fect of unknown influences such as a compound library, can be compared to Influence of 
known reference compounds under standardised conditions. 

20 

The recording 

In addition to the intensity, there are several parameters of fluorescence or luminescence 
which can be modulated by the effect of the influence on the underlying cellular phenomena, 
and can therefore be used in the invention. Some examples are resonance energy transfer, 
25 fluorescence lifetime, polahsation, wavelength shift. Each of these methods requires a parti- 
cular kind of Alter in the emission light path to select the component of the light desired and 
reject other comoonen^^ T^-p -o-,---<;-- 



err^noaime^^t "^vr.-'::;:::: , bpaiiaiiy aiStriDuted light emitted by a luminophore 

could be detected by a change in the resonance energy transfer between the luminophore 
and another luminescent entity capable of delivenno pv^^ov *c t^o ': :n':^nr n'-^'^ n -r- - - 
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which has been selected or engineered to become part of, bound to or associated with parti- 
cular components of the intracellular pathway. In this embodiment, either the luminophore or 
the luminescent entity capable of delivering energy to the luminophore undergoes redistribu- 
tion in response to an influence. The resonance energy transfer would be measured as a 
5 change in the intensity of emission from the luminophore, preferably sensed by a single 
channel photodetector which responds only to the average intensity of the luminophore in a 
non-spatially resolved fashion. 

In one embodiment of the invention, the recording of the spatially disthbuted light could be 
made at a single point in time after the application of the influence. In another embodiment, 

10 the recording could be made at two points in time, one point being before, and the other 
point being after the application of the influence. The result or variation is determined from 
the change in fluorescence compared to the fluorescence measured prior to the influence or 
modulation. In another embodiment of the invention, the recording could be performed at a 
series of points in time, in which the application of the influence occurs at some time after 

15 the first time point in the series of recordings, the recording being performed, e.g., with a 
predetermined time spacing of from 0.1 seconds to 1 hour, preferably from 1 to 60 seconds, 
more preferably from 1 to 30 seconds, in particular from 1 to 10 seconds, over a time span 
of from 1 second to 12 hours, such as from 10 seconds to 12 hours, e.g., from 10 seconds to 
one hour, such as from 60 seconds to 30 minutes or 20 minutes. The result or variation is 

20 determined from the change in fluorescence over time. The result or variation could also be 
determined as a change in the spatial distribution of the fluorescence over time. 



Apparatus 

The recording of spatially distributed luminescence emitted from the luminophore is perfor- 
25 med by an apparatus for measuring the distribution of fluorescence in the cell or cells, and 
thereby any change in the distribution of fluorescence in the ceil or cells, which includes at a 
minimum the following component parts: (a) a light source, (b) a method for selecting the 
wavelength(s) of light from the source wHfr>h nv-'^-. - ^' 

■:ii)iuea tiuorescencG in a spatially resolved fashion, and forming an image from this fluore- 
scence emission, (e) a bench or stand which holds the container of the cells being measured 
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record the spatially resolved fluorescence in the form of an innage, (g) a connputer or electro- 
nic system and associated software to acquire and store the recorded images, and to com- 
pute the degree of redistribution from the recorded images. 

In a preferred embodiment of the invention the apparatus system is automated. In one em- 
5 bodiment the components in d and e mentioned above comprise a fluorescence microscope. 
In one embodiment the component in f mentioned above is a CCD camera. 

In one embodiment the image is fomned and recorded by an optical scanning system. 

In one embodiment a liquid addition system is used to add a known or unknown compound 
to any or all of the cells in the cell holder at a time determined in advance. Preferably, the 
10 liquid addition system is under the control of the computer or electronic system. Such an 

automated system can be used for a screening program due to its ability to generate results 
from a larger number of test compounds than a human operator could generate using the 
apparatus in a manual fashion. 



15 Quantitation of the influence 

The recording of the variation or result with respect to light emitted from the luminophore is 
performed by recording the spatially distributed light as one or more digital images, and the 
processing of the recorded variation to reduce it to one or more numbers representative of 
the degree of redistribution comprises a digital image processing procedure or combination 

20 of digital image processing procedures. The quantitative information which is indicative of the 
degree of the cellular response to the influence or the result of the influence on the intracel- 
lular pathway is extracted from the recording or recordings according to a predetermined ca- 
libration based on responses or results, recorded in the same manner, to known degrees of 
a relevant specific influence. This calibration procedure is developed according to principles 

25 described below (Developing an Image-based Assay Technique). Specific descriptions of the 
procedures for particular assays are given in the examples. 

While the stepwise procedure necessary to reduce the \m^rn 

sentative of ^h-^ - 

: ... .icuLji ste;3S arp po;:"' cpcidiionb 
. o..:3tract;c:'-' ^'lrln:ra -r::: inresr.oiaing, digital filtering methods such as smoothing, 
sharpening, and edge detection, spatial frequency methods such as Fourier filtering, imaae 
cross-correiation and image autocorrelation ^hir-^* ^ r r : . - * - - : i , - 
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and colour space manipulations for visualisation. In addition to the algorithmic procedures, 
heuristic methods such as neural networks may also be used. 



Nucleic acid constructs 

5 The nucleic acid constructs used in the present invention encode in their nucleic acid sequ- 
ences fusion polypeptides comprising a biologically active polypeptide that is a component of 
an intracellular signalling pathway, or a part thereof, and a GFP, preferabiy an F64L mutant 
of GFP, N- or C-terminally fused, optionally via a peptide linker, to the biologically active po- 
lypeptide or part thereof, 

10 In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein kinase or a phosphatase. 

in one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a transcription factor or a part thereof which changes cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
15 is a protein, or a part thereof, which is associated with the cytoskeletal network and which 
changes cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein kinase or a part thereof which changes cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
20 is a serine/threonine protein kinase or a part thereof capable of changing intracellular locali- 
sation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a tyrosine protein kinase or a part thereof capable of changing intracellular localisation 
upon activation. 

25 In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a phospholipid-dependent serine/threonine protein kinn^p n r>^^ +-r. • 
^ H p ' 

. . ............. ...^,vpept:ae en^^nnec: r:^y v-.e Muc.eic ac:C consirjci 

t rAr/.P-depenaent protein kinase or a part thereof capable of changing cellular localisati- 
30 on upon activation. In a preferred embodiment the biologically active DolvpeDtide enccdod ^ . 
the nucleic acid const-'uct n PKA- rr^.iL ^S^'-T /"^-^ ' 
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In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a cGMP-dependent protein kinase or a part thereof capable of changing cellular localisati- 
on upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
5 is a calmodulin-dependent serine/threonine protein kinase or a part thereof capable of chan- 
ging cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nuclein acid construct 
is a mitoqen-activated serine/threonine protein kinase or a part thereof capable of changing 
cellular localisation upon activation. In preferred embodiments the biologically active poly- 
10 peptide encoded by the nucleic acid constructs are an ERK1-F64L-S65T-GFP fusion or an 
EGFP-ERK1 fusion. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a cyclin-dependent serine/threonine protein kinase or a part thereof capable of changing 
cellular localisation upon activation. 

15 In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
is a protein phosphatase or a part thereof capable of changing cellular localisation upon acti- 
vation. 

In one preferred embodiment of the invention the nucleic acid constructs may be DNA con- 
structs. 

20 In one embodiment the biologically active polypeptide encoded by the nucleic acid construct 
In one embodiment the gene encoding GFP in the nucleic acid construct is derived from Ae- 
quorea victoria. In a preferred embodiment the gene encoding GFP in the nucleic acid con- 
struct is EGFP or a GFP variant selected from F64L-GFP. F64L-Y66H-GFP and F64L-S65T- 
GFP. 

25 In preferred embodiments of the invention the DNA constructs which can be identified by any 
of the DNA sequences shown in SEQ ID NO: 38, 40. 42, 44. 46, 48, 50, 52, 54. 56. 58. 60, 
62,64,66,68.70,72,74,76.78,108,110,112,114 i^f^ ' ' 

... /. ■.• :.;.o;oaicai:V ^-r; '''-"e:'. . 

. ./ :;:;:rn^r-v - r .] 3::ii^t^ varia::: or a nonnoiogue from another species. 
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Screening program 

The present invention describes a method that may be used to establish a screening pro- 
gram for the identification of biologically active substances that directly or indirectly affects 
intracellular signalling pathways and because of this property are potentially useful as me- 
5 dicaments. Based on measurements in living cells of the redistribution of spatially resolved 
luminescence from luminophores which undergo a change in distribution upon activation or 
deactivation of an intracellular signalling pathway the result of the individual measurement of 
each substance being screened indicates its potential biological activity. 

In one embodiment of the invention the screening program is used for the identification of a 
10 biologically toxic substance as defined herein that exerts its toxic effect by interfering with an 
intracellular signalling pathway. Based on measurements in living cells of the redistribution of 
spatially resolved luminescence from luminophores which undergo a change in distribution 
upon activation or deactivation of an intracellular signalling pathway the result of the individ- 
ual measurement of each substance being screened indicates its potential biologically toxic 
15 activity. In one embodiment of a screening program a compound that modulates a compo- 
nent of an intracellular pathway as defined herein, can be found and the therapeutic amount 
of the compound estimated by a method according to the method of the invention. In a pre- 
ferred embodiment the present invention leads to the discovery of a new way of treating a 
condition or disease related to the intracellular function of a biologically active polypeptide 
20 comprising administration to a patient suffering from said condition or disease of an effective 
amount of a compound which has been discovered by any method according to the inven- 
tion. In another preferred embodiment of the invention a method is established for identifica- 
tion of a new drug target or several new drug targets among the group of biologically active 
polypeptides which are components of intracellular signalling pathways. 

25 In another embodiment of the invention an Individual treatment regimen is established for the 
selective treatment of a selected patient suffering from an ailment where the available me- 
dicaments used for treatment of the ailment are tested on a relevant primary cell or cells 
obtained from said patient from one or several tissues, using a method comnricin- 
fectinq the rp!' ^^n^ - 

^.i^-. . ij.j caiieru. 

■ . ■ v:ciis unon'' ^cnc:tic"5 f^errr.itting tne expression of said probes and ex- 

posing it to an array of the available medicaments, then comparing changes in fluorescence 
patterns or redistribution patterns of the fluorescent r^'-obe^ in r-^.trr:: I:;r-: ■ - >■ ■ ' 
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detect the cellular response to the specific medicaments (obtaining a cellular action profile), 
then selecting one or more medicament or medicaments based on the desired activity and 
acceptable level of side effects and administering an effective amount of these medicaments 
to the selected patient. 

5 

Back-tracking of a signal transduction pathway 

The present invention describes a method that may be used to establish a screening pro- 
gram Tor back-tracking signal transduction pathways as defined herein. In one embodiment 
the screening program is used to establish more precisely at which level one or several 
10 compounds affect a specific signal transduction pathway by successively or in parallel test- 
ing the influence of the compound or compounds on the redistribution of spatially resolved 
luminescence from several of the luminophores which undergo a change in distribution upon 
activation or deactivation of the intracellular signalling pathway under study. 



15 Construction and testing of probes 

In general, a probe, i.e. a "GeneX^-GFR fusion or a GFP-"GeneX" fusion, is constructed 
using PGR with "GeneX"-specific primers followed by a cloning step to fuse "GeneX" in 
frame with GFP. The fusion may contain a short vector derived sequence between "GeneX" 
and GFP (e.g. part of a multiple cloning site region in the plasmid) resulting in a peptide lin- 
20 ker between "GeneX" and GFP in the resulting fusion protein. 



Detailed stepwise procedure: 

- Identifying the sequence of the gene. This is most readily done by searching a depository 
of genetic information, e.g. the GenBank Sequence Database, which is widely available and 
25 routinely used by molecular biologists. In the specific examples below the GenBank Acces- 
sion number of the gene in question is provided. 



^one-specitic primers to oe used in a PGR reaction. Typically, the top-strand primer en- 
30 compasses the ATG start codon of the gene and the following ca. 20 nucleotides, while the 

bottom-strand vr^nr en :^r,r^ :-]':-. ^~^r - t-^- ''-n [ ■ ^ ^ n -t'-^-^ 
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the gene is to be fused behind GFP, i.e. a GFP-"GeneX" fusion. If the gene is to be fused in 
front of GFP, i.e. a "GeneX"-GFP fusion, a stop codon must be avoided. Optionally, the full 
length sequence of GeneX may not be used in the fusion, but merely the part which localizes 
and redistributes like GeneX in response to a signal. 

5 

In addition to gene-specific sequences, the primers contain at least one recognition sequen- 
ce for a restriction enzyme, to allow subsequent cloning of the PGR product. The sites are 
chosen so that they arc unique in the PGR product and compatible with sites in the cloning 
vector. Furthermore, it may be necessary to include an exact number of nucleotides between 
10 the restriction enzyme site and the gene-specific sequence in order to establish the correct 
reading frame of the fusion gene and/or a translation initiation consensus sequence. Lastly, 
the primers always contain a few nucleotides in front of the restriction enzyme site to allow 
efficient digestion with the enzyme. 



15 -Identifying a source of the gene to be amplified. In order for a PGR reaction to produce a 
product with gene-specific primers, the gene-sequence must initially be present in the reacti- 
on, e.g. in the form of cDNA. Information in GenBank or the scientific literature will usually 
indicate in which tissue(s) the gene is expressed, and cDNA libraries from a great variety of 
tissues or cell types from various species are commercially available, e.g. from Clontech 

20 (Palo Alto), Stratagene (La Jolla) and Invitrogen (San Diego). Many genes are also available 
in cloned form from The American Type Tissue Collection (Virginia). 



- Optimizing the PGR reaction. Several factors are known to influence the efficiency and 
specificity of a PGR reaction, including the annealing temperature of the primers, the con- 
25 centration of ions, notably Mg^* and K', present in the reaction, as well as pH of the reaction, 
if the result of a PGR reaction is deemed unsatisfactory, it might be because the parameters 
mentioned above are not optimal. Various annealing temperatures should h^^ tonfo^ 

PGR mRrhinn vv-^^ - I 
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- Cloning the PGR product. The vector into which the amplified gene product will be cloned 
and fused with GFP will already have been taken into consideration when the primers were 
designed. When choosing a vector, one should at least consider in which cell types the pro- 
be subsequently will be expressed, so that the promoter controlling expression of the probe 

5 is compatible with the cells. Most expression vectors also contain one or more selective 
markers, e.g. confemng resistance to a drug, which is a useful feature when one wants to 
make stable transfectants. The selective marker should also be compatible with the cells to 
be used. 

10 The actual cloning of the PGR product should present no difficulty as it typically will be a 
one-step cloning of a fragment digested with two different restriction enzymes into a vector 
digested with the same two enzymes. If the cloning proves to be problematic, it may be be- 
cause the restriction enzymes did not work well with the PGR fragment. In this case one 
could add longer extensions to the end of the primers to overcome a possible difficulty of di- 

15 gestion close to a fragment end, or one could introduce an intermediate cloning step not ba- 
sed on restriction enzyme digestion. Several companies offer systems for this approach, e.g. 
Invitrogen (San Diego) and Clontech (Palo Alto). 

Once the gene has been cloned and, in the process, fused with the GFP gene, the resulting 
20 product, usually a plasmid, should be carefully checked to make sure it is as expected. The 
most exact test would be to obtain the nucleotide sequence of the fusion-gene. 

Testing the probe 

Once a DNA construct for a probe has been generated, its functionality and usefulness may 
25 be tested by subjecting it to the following tests: 

- Transfecting it into cpH^ ^^nr^-^^'- ' 
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The intensity should usually be at least as strong as that of unfused GFP in the cells. If it is 
not, the sequence or quality of the probe-DNA might be faulty, and should be carefully 
checked. 

5 The sub-cellular localization is an indication of whether the probe is likely to perform well, If it 
localizes as expected for the gene in question, e.g. is excluded from the nucleus, it can im- 
mediately go on to a functional test. If the probe is not localized soon after the transfection 
procedure, it may be because of overexpression at this point in time, as the cell typically will 
have taken of very many copies of the plasmid, and localization will occur in time, e.g. within 

10 a few weeks, as plasmid copy number and expression level decreases. If localization does 
not occur after prolonged time, it may be because the fusion to GFP has destroyed a locali- 
zation function, e.g. masked a protein sequence essential for interaction with its normal cel- 
lular anchor-protein. In this case the opposite fusion might work, e.g. if GeneX-GFP does not 
work, GFP-GeneX might, as two different parts of GeneX will be affected by the proximity to 

1 5 GFP. If this does not work, the proximity of GFP at either end might be a problem, and it 

could be attempted to increase the distance by incorporating a longer linker between GeneX 
and GFP in the DNA construct. 

If there is no prior knowledge of localization, and no localization is observed, it may be be- 
cause the probe should not be localized at this point, because such is the nature of the pro- 
20 tein fused to GFP. It should then be subjected to a functional test. 

In a functional test, the cells expressing the probe are treated with at least one compound 
known to perturb, usually by activating, the signalling pathway on which the probe is expec- 
ted to report by redistributing itself within the cell. If the redistribution is as expected, e.g. if 
prior knowledge tell that it should translocate from location X to location Y, it has passed the 
25 first critical test. In this case it can go on to further characterization and quantification of the 
response. 

If it does not perform as expected, it may be because the cell lacks at least one comnonent 
of the signalling pathwav. e 0 n^^''^'"-^" - 

^.;:cj::::cjo o'lo sncL^in irientiiv c:^ei ceii lypes tor the testing process 
/vfierc tnose potential problems would not apply. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



17 

If there is no prior knowledge about the pattern of redistribution, the analysis of the redistri- 
bution will have to be done in greater depth to identify what the essential and indicative fea- 
tures are, and when this is clear, it can go on to further characterization and quantification of 
the response. If no feature of redistribution can be identified, the problem might be as menti- 
5 oned above, and the probe should be retested under more optimal cellular conditions. 

If the probe does not perform under optimal cellular conditions it's back to the drawing board. 



Deveioping an image-based assay technique 

The process of developing an image-based redistribution assay begins with either the un- 
10 planned experimental observation that a redistribution phenomenon can be visualised, or the 
design of a probe specifically to follow a redistribution phenomenon already known to occur 
In either event, the first and best exploratory technique Is for a trained scientist or technician 
to observe the phenomenon. Even with the rapid advances in computing technology, the 
human eye-brain combination is still the most powerful pattern recognition system known, 
15 and requires no advance knowledge of the system in order to detect potentially interesting 
and useful patterns in raw data. This is especially if those data are presented in the form of 
images, which are the natural "data type" for human visual processing. Because human vi- 
sual processing operates most effectively in a relatively narrow frequency range, i.e., we 
cannot see either very fast or very slow changes in our visual field, it may be necessary to 
20 record the data and play it back with either time dilation or time compression. 

Some tuminescence phenomena cannot be seen directly by the human eye. Examples in- 
clude polarization and fluorescence lifetime. However, with suitable filters or detectors, these 
signals can be recorded as images or sequences of images and displayed to the human in 
the fashion just described. In this way, patterns can be detected and the same methods can 
25 be applied. 

Once the redistribition has been determined to be a reproducible phenomenon, one or more 
data sets are generated for the purpose of developing a procedure ov^m * • * 

.s:>. .■ '^'av DO npr-pccnrw * - ."eveiop a quantitative procedure m order to assess the effect 
on the assay of manipulating the assay conditions, 
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The data sets are examined by a person or persons with knowledge of the biological phe- 
nomenon and skill in the application of image processing techniques. The goal of this exerci- 
se is to determine or at least propose a method which will reduce the image or sequence of 
images constituting the record of a "response" to a value corresponding to the degree of the 
5 response. Using either interactive image processing software or an image processing tool- 
box and a programming language, the method is encoded as a procedure or algorithm which 
takes the image or images as input and generates the degree of response (in any units) as 
its output. Some of the criteria for evaluating the validify of a particular procedure are: 

• Does the degree of the response vary in a biologically significant fashion, i.e., does it 
10 show the known or putative dependence on the concentration of the stimulating 

agent or condition? 

• is the degree of response reproducible, i.e., does the same concentration or level of 
stimulating agent or condition give the same response with an acceptable variance? 

• Is the dynamic range of the response sufficient for the purpose of the assay? If not, 
15 can a change in the procedure or one of its parameters improve the dynamic range? 

• Does the procedure exhibit any clear "pathologies", i.e., does it give ridiculous values 
for the response if there are commonly occurring imperfections in the imaging pro- 
cess? Can these pathologies be eliminated, controlled, or accounted for? 

• Can the procedure deal with the normal variation in the number and/or size of cells in 
20 an image? 

In some cases the method may be obvious; in others, a number of possible procedures may 
suggest themselves. Even if one method appears clearly superior to others, optimisation of 
parameters may be required. The various procedures are applied to the data set and the 
25 criteria suggested above are determined, or the single procedure is applied repeatedly with 
adjustment of the parameter or parameters until the most satisfactory combination of signal, 
noise, range, etc. are arrived at. This is eauivaient to +ho ^nt;t--nt-. - 

... ..:r;c: .'rj.^.e irr^r ic:ge ait; exirGmeiy large, and 

' :;i> cii: ;nie!iigeni approach must be taken to the initial step of reducing this number to a 
small, finite number of possible procedures. This is not to say that the oroced'-r^i n''^'\ ^-i 
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necessarily the best procedure - but a global search for the best procedure is simply out of 
the question due to the sheer number of possibilities involved. 

Image-based assays are no different than other assay techniques in that their usefulness is 
characterised by parameters such as the specificity for the desired component of the 
5 sample, the dynamic range, the variance, the sensitivity, the concentration range over which 
the assay will work, and other such parameters. While it is not necessary to characterise e- 
ach and every one of these before using the assay, they represent the only way to compare 
one assay with another. 



10 Example: Developfno a Quantitative assav for GLUT4 Translocation 

GLUT4 is a member of the class of glucose transporter molecules which are important in 
cellular glucose uptake, it is known to translocate to the plasma membrane under some 
conditions of stimulation of glucose uptake. The ability to visualize the glucose uptake re- 
sponse noninvasively, without actually measuring glucose uptake, would be a very useful 
15 assay for anyone looking for, for example, treatments for type !l diabetes. 

A CHO ceil line which stably expressed the human insulin receptor was used as the basis for 
a new cell line which stably expressed a fusion between GLUT4 and GFP. This cell line was 
expected to show translocation of GLUT4 to the plasma membrane as visualized by the 
movement of the GFP. The translocation could definitely be seen in the form of the appea- 
20 ranee of local increases in the fluorescence in regions of the plasma membrane which had a 
characteristic shape or pattern. This is shown in Figure 12. 

These objects became known as "snircles", and the phenomenon of their appearance as 
"snircling". In order to quantitate their appearance, a method had to be found to isolate them 
as objects in the image field, and then enumerate them, measure their area, or determine 
25 some parameter about them which correlated In a dose-dependent fashion with the concen- 
tration of insulin to which the cells had been exposed. In order to separate the snircles, a bi- 
narization procedure was applied in which one copy of the imaae smoothed w'^^ ^ '-^^ 

d;e'j ;is :T:;n max rnnc^e i:--: an auiomat.c ttuesnold was applied to divide the image into 
two levels. The thresholded image contains a background of one value all found object with 
another value. The found objects were first fiUored thrn:;r;h a fi't-?^ " -f.-.n^ ; .c ---r.- v,- - 
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large and far too small to be snircles. The remaining objects, which represent snircles and 
other artifacts from the image with approximately the same size and intensity characteristics 
as snircles, are passed into a classification procedure which has been previously trained with 
many images of snircles to recognize snircles and exclude the other artifacts. The result of 
5 this procedure is a binary image which shows only the found snircles to the degree to which 
the classification procedure can accurately identify them. The total area of the snircles is 
then summed and this value is the quantitative measure of the degree of snircling for that 
image. 



10 Definitions: 

In the present specification and claims, the term "an influence" covers any influence to which 
the cellular response comprises a redistribution. Thus, e.g., heating, cooling, high pressure, 
low pressure, humidifying, or drying are influences on the cellular response on which the re- 
sulting redistribution can be quantified, but as mentioned above, perhaps the most important 
15 influences are the influences of contacting or incubating the cell or cells with substances 

which are known or suspected to exert and influence on the cellular response involving a re- 
distribution contribution. In another embodiment of the invention the influence could be sub- 
stances from a compound drug library. 



20 In the present context, the term "green fluorescent protein" is intended to indicate a protein 
which, when expressed by a cell, emits fluorescence upon exposure to light of the correct 
excitation wavelength (cf. [(Chalfie ef a/.1994)]). In the following, GFP in which one or more 
amino acids have been substituted, inserted or deleted is most often termed "modified GFP". 
"GFP" as used herein includes wild-type GFP derived from the jelly fish Aequorea victoria 

25 and modifications of GFP, such as the blue fluorescent variant of GFP disclosed by Heim et 
al. (1994). Proc.Natl.Acad.Sci. 91:12501, and other modifications that change the spectral 
properties of the GFP fluorescence, or modifications that exhibit increased fluorescennn 
when expressed in nell'^ n ^nr^^n.^-^ 

... . ...oiuscc:;: :roiein nf^nvo:: t^cti A^^quured tureen hiuorescent Protein 

, GFP; or any functional analogue thereof, wherein the amino acid in position 1 upstream from 
the chromophore has been mutated to provide an increase of fluoresc'^^^^ .v^^^ '^ " ■ 
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fluorescent protein of the invention is expressed in ceils. Preferred GFP variants are F64L- 
GFP, F64L-Y66H-GFP and F64L-S65T-GFP. An especially preferred variant of GFP for use 
in all the aspects of this invention Is EGFP (DNA encoding EGFP which is a F64L-S65T va- 
riant with codons optimized for expression in mammatian cells is available from Clontech, 
5 Palo Alto, plasmids containing the EGFP DNA sequence, cf. GenBank Acc. Nos. U55762, 
U55763). 

The ternn "intracsllular signaiiing pathway" and "signal transduction pathway" are intended to 
indicate the coordinated intracellular processes whereby a living cell transduce an external 
10 or internal signal into cellular responses. Said signal transduction will involve an enzymatic 
reaction said enzymes include but are not limited to protein kinases, GTPases, ATPases, 
protein phosphatases, phospholipases. The cellular responses include but are not limited to 
gene transcription, secretion, proliferation, mechanical activity, metabolic activity, cell death. 

15 The term "second messenger" is used to indicate a low molecular Vveight component invol- 
ved in the eariy events of intracellular signal transduction pathways. 

The term "luminophore" is used to indicate a chemical substance which has the property of 
emitting light either inherently or upon stimulation with chemical or physical means. This in- 
cludes but is not limited to fluorescence, bioluminescence, phosphorescence, chemilumine- 
20 scence. 

The term "mechanically intact living cell" is used to indicate a cell which is considered living 
according to standard criteria for that particular type of cell such as maintenance of normal 
membrane potential, energy metabolism, proliferative capability, and has not experienced 
25 any physically invasive treatment designed to introduce external substances into the cell 
such as microinjection. 

^Mraceiiuiar comnn^^o' / - rredb./ecj oy a cnange in the luminescence properties 
or distribution, is used to indicate that said redistribution can be explained in terms of the un- 
derlying biological phenomenon which gives rise tn roHir^riu, 
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Th terms "image processing" and "image analysis" are used to describe a large family of di- 
gital data analysis techniques or combination of such techniques which reduce ordered ar- 
rays of numbers (images) to quantitative information describing those ordered arrays of 
numbers. When said ordered arrays of numbers represent measured values from a physical 
5 process, the quantitative information derived is therefore a measure of the physical process. 

The term "fluorescent probe" is used to indicate a fluorescent fusion polypeptide com.prising 
a GFP or any functional part thereof which is N- or C-terminally fused to a biologically active 
polypeptide as defined herein, optionally via a peptide linker consisting of one or more amino 
10 acid residues, where the size of the linker peptide In itself is not critical as long as the desi- 
red functionality of the fluorescent probe is maintained. A fluorescent probe according to the 
invention is expressed in a cell and basically mimics the physiological behaviour of the biolo- 
gically active polypeptide moiety of the fusion polypeptide. 

15 The term "mammalian cell" is intended to indicate any living cell of mam.malian origin. The 
cell may be an established cell line, many of which are available from The American Type 
Culture Collection (ATCC, Virginia, USA) or a primary cell with a limited life span derived 
from a mammalian tissue, including tissues derived from a transgenic animal, or a newly 
established immortal cell line derived froma mammalian tissue including transgenic tissues, 

20 or a hybrid cell or cell line derived by fusing different celltypes of mammalian origin e.g. hy- 
bridoma cell lines. The cells may optionally express one or more non-native gene products, 
e.g. receptors, enzymes, enzyme substrates, prior to or in addition to the fluorescent probe. 
Preferred cell lines include but are not limited to those of fibroblast origin, e.g. BHK, CHO, 
BALB, or of endothelial origin, e.g. HUVEC. BAE (bovine artery endothelial), CPAE (cow 

25 pulmonary artery endothelial) or of pancreatic origin, e.g. RIN, lNS-1 , MIN6, bTC3, aTC6, 
bTC6, HIT, or of hematopoietic origin, e.g. adipocyte origin, e.g. 3T3-L1 , neuro- 
nal/neuroendocrine origin, e.g. AtT20, PCI 2, GH3, muscle origin, e.g. SKMC, A10, C2C12, 
renal origin, e.g. HEK 293, LLC-PK1. 

. ..:..:e.:.o : , :'vs case p.^. 'eas! a conioii oi :ne green liuorescent 
, .uioiM, 3na at least a portion of a catalytic and/or regulatory domain of a protein kinase. 
Furthermore a hybrid polypeptide Is intended to indicate a fusion polvoeDtide -'-^^r^'z'r^ • 
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GFP or at least a portion of the green fluorescent protein that contains a functional ftuorop- 
hore, and at least a portion of a biologically active polypeptide as defined herein provided 
that said fusion is not the PKCa-GFP, PKCy-GFP, and PKCs-GFP disclosed by Schmidt et 
al.and Sakai et a!., respectively. Thus, GFP may be N- or C-terminally tagged to a biologi- 
5 cally active polypeptide, optionally via a linker portion or linker peptide consisting of a sequ- 
ence of one or more amino acids. The hybrid polypeptide or fusion polypeptide may act as a 
fluorescent probe in intact living cells carrying a DNA sequence encoding the hybrid poly- 
peptide under conditions permitting expression of said hybrid polypeptide. 

10 The term "kinase" is intended to indicate an enzyme that is capable of phosphorylating a 
cellular component. 

The term "protein kinase" is intended to indicate an enzyme that is capable of phosphoryla- 
ting serine and/or threonine and/or tyrosine in peptides and/or proteins. 

15 The term "phosphatase" is intended to indicate an enzyme that is capable of dephosphoryla- 
ting phosphoserine and/or phosphothreonine and/or phosphotyrosine in peptides and/or 
proteins. 

In the present context, the term "biologicaily active polypeptide" is intended to indicate a po- 
20 lypeptide affecting intracellular processes upon activation, such as an enzyme which is acti- 
ve in intracellular processes or a portion thereof comprising a desired amino acid sequence 
which has a biological function or exerts a biological effect in a cellular system. In the poly- 
peptide one or several aminoacids may have been deleted, inserted or replaced to alter its 
biological function, e.g. by rendering a catalytic site inactive. Preferably, the biologically acti- 
25 ve polypeptide is selected from the group consisting of proteins taking part in an intracellular 
signalling pathway, such as enzymes involved in the intracellular phosphorylation and dep- 
hosphorylation processes including kinases, protein i^inqcor -^^j .a , 

■ ' ■ ^ :-^ea^-,:r-i: : V..: :':G!or:irai!v :"3r::;'/6 puiypeptide 
-^eif: f.'orp prr.t^rac.v, \:\e LiOiogicaiiy active polypeptide is a protein which according to its 
state as activated or non-activated changes localisation within the cell, preferably as an in- 
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termediary component in a signal transduction pathway. Included in this preferred group of 
bioiogically active polypeptides are cAMP dependent protein kinase A. 

The term "a substance having biological activity" is intended to indicate any sample which 
5 has a biological function or exerts a biological effect in a cellular system. The sample may be 
a sample of a biological material such as a sample of a body fluid including blood, plasma, 
saliva, milk, urine, or a microbial or plant extract, an environmental sample containing pollu- 
tants including heavy metals or toxins, or it may be a sample containing a compound or 
mixture of compounds prepared by organic synthesis or genetic techniques. 

10 

The phrase "any change in fluorescence" means any change in absorption properties, such 
as wavelength and intensity, or any change in spectral properties of the emitted light, such 
as a change of wavelength, fluorescence lifetime, intensity or polarisation, or any change in 
the intracellular localisation of the fluorophore. It may thus be localised to a specific cellular 
15 component (e.g. organelle, membrane, cytoskeleton, molecular structure) or it may be 
evenly distributed throughout the cell or parts of the cell. 

The phrase "back-tracking of a signal transduction pathway" is intended to indicate. 

The term "organism" as used herein indicates any unicellular or multicellular organism prefe- 
rably originating from the animal kingdom including protozoans, but also organisms that are 
20 members of the plant kingdoms, such as algae, fungi, bryophytes, and vascular plants are 
included in this definition. 

The term "nucleic acid" is intended to indicate any type of poly- or oligonucleic acid sequen- 
ce, such as a DNA sequence, a cDNA sequence, or an RNA sequence. 

25 

The term "biologically equivalent" as it relates to orotein^ i<^^ ir^+nnrin-^ * 

. ■..;:cj:ea i5ororp^s enrocea ::'\' c: :u.:- 
■ i aeries. ■! tnpv r^^e so;;cir^ .anarns, cr allelic variants derived from the same gene, if 
30 they perform identical cellular functions in different cell types, or in different species. The 
term "biologically equivalent" as it --R'ate^ t-^ DNA r->-^— c.-^ ^>,r. >- y . t r. • ^ .... ; 
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ence encoding a polypeptide is equivalent to a second DNA sequence encoding a polypepti- 
de if the functional proteins encoded by the two genes are biologically equivalent. 

The phrase "back-tracking of a signal transduction pathway" is intended to indicate a process 
5 for defining more precisely at what level a signal transduction pathway is affected, either by 
the influence of chennicai compounds or a disease state in an organism. Consider a specific 
signal transduction pathway represented by the bioactive polypeptides A - B - C - n, with 
signal transduction from A towards D. When investigating all components of this signal 
transduction pathway compounds or disease states that influence the activity or redistribu- 
10 tion of only D can be considered to act on C or downstream of C whereas compounds or 
disease states that influence the activity or redistribution of C and D, but not of A and B can 
be considered to act downstream of B. 

The term "fixed cells" is used to mean cells treated with a cytological fixative such as glu- 
15 taraldehyde or formaldehyde, treatments which serve to chemically cross-link and stabilize 
soluble and insoluble proteins within the structure of the cell. Once in this state, such pro- 
teins cannot be lost from the structure of the now-dead cell. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 . CHO cells expressing the PKAc-F64L-S65T-GFP hybrid protein have been treated 
in HAM'S F12 medium with 50 mM forskolin at 37°C. The images of the GFP fluorescence in 
25 these cells have been taken at different time intervals after treatment, which were: a) 40 se- 
conds b) 60 seconds c) 70 seconds d) 80 seconds. The fluores^pr^-^ ^ 
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Figure 2. Time-lapse analysis of forskolin induced PKAc-F64L-S65T-GFP redistribution. 
CHO cells, expressing the PKAc-F64L-S65T-GFP fusion protein were analysed by time- 
lapse fluorescence microscopy. Fluorescence micrographs were acquired at regular intervals 
from 2 min before to 8 min after the addition of agonist. The cells were challenged with 1 mM 
5 forskolin immediately after the upper left image was acquired (t=0). Frames were collected at 
the following times: i) 0, ii) 1, iii) 2, iv) 3, v) 4 and vi) 5 minutes. Scale bar 10 mm. 



Figure 3. Time-lapse analyses of PKAc-F64L-S65T-GFP redistribution in response to various 

10 agonists. The effects of 1 mM forskolin (A), 50 mM forskolin (B), 1mM dbcAMP (C) and 100 
mM IBMX (D) (additions indicated by open arrows) on the localisation of the PKAc-F64L- 
S65T-GFP fusion protein were analysed by time-lapse fluorescence microscopy of 
CHO/PKAC-F64L-S65T-GFP cells. The effect of addition of 10 mM forskolin (open arrow), 
followed shortly by repeated washing with buffer {solid arrow), on the localisation of the 

15 PKAC-F64L-S65T-GFP fusion protein was analysed in the same cells (E). In a parallel expe- 
riment, the effect of adding 10 mM forskolin and 100 mM IBMX (open arrow) followed by re- 
peated washing with buffer containing 100 mM IBMX (solid arrow) was analysed (F). Remo- 
ving forskolin caused PKAc-F64L-S65T-GFP fusion protein to return to the cytoplasmic ag- 
gregates while this is prevented by the continued presence of IBMX (F). The effect of 100 

20 nM glucagon (Fig 3G, open arrow) on the localisation of the PKAc-F64L-S65T-GFP fusion 
protein is also shown for BHK/GR, PKAc-F64L-S65T-GFP cells. The effect of 10 mM norepi- 
nephhne (H), solid arrow, on the localisation of the PKAc-F64L-S65T-GFP fusion protein 
was analysed similarly, in transiently transfected CHO, PKAc-F64L-S65T-GFP cells, pretre- 
ated with 10 mM forskolin, open arrow, to increase [cAMP];. N.B. in Fig 3H the x-axis counts 

25 the image numbers, with 12 seconds between images. The raw data of each experiment 
consisted of 60 fluorescence micrographs acquired at regular intervals including several 
images acquired before the addition of buffer or agonist. The charts (A-G) each show a 
quantification of the response seen through all the 60 imaaes nerp^rmnr^ ^ 
analysis mpthn^^ 

. :r3:}n:-- ■ '^-'m^t^ 
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Figure 4. Dose response curve (two experiments) for forskolin-induced redistribution of the 
PKAC-F64L-S65T-GFP fusion. 

5 

Figure 5. Time from initiation of a response to half maximal (ti^jmax) and maximal (t^^ J PKAc- 
F64L-S65T-GFP redistribution. The data was extracted from curves such as that shown in 
"Figure 2." All ti/2max and X^^^ values are given as meaniSD and are based on a total of 26-30 
cells from 2-3 independent experiments for each forskolin concentration. Since the observed 
10 redistribution is sustained over time, the t^^, values were taken as the earliest time point at 
which complete redistribution is reached. Note that the values do not relate to the degree of 
redistribution. 



15 Figure 6. Parallel dose response analyses of forskolin induced cAMP elevation and PKAc- 
F64L-S65T-GFP redistribution. The effects of buffer or 5 increasing concentrations of forsko- 
lin on the localisation of the PKAc-F64L-S65T-GFP fusion protein in CHO/PKAc-F64L-S65T- 
GFP cells, grown in a 96 well plate, were analysed as described above. Computing the ratio 
of the SD's of fluorescence micrographs taken of the same field of cells, prior to and 30 min 

20 after the addition of forskolin, gave a reproducible measure of PKAc-F64L-S65T-GFP redi- 
stribution. The graph shows the individual 48 measurements and a trace of their me- 
an±s.e.m at each forskolin concentration. For comparison, the effects of buffer or 8 increa- 
sing concentrations of forskolin on [cAMP]i was analysed by a scintillation proximity assay of 
cells grown under the same conditions. The graph shows a trace of the mean ± s.e.m of 4 

25 experiments expressed in arbitrary units. 



Figure 7. BHK cells stah'v t-n--v * 

- nriS'er-' 

^ ' .J'.:. ; G^-- *-c: v.yiopiasm lo the plasma membrane, Ima- 
.-s were taKen at tne following times: a) 1 second before carbachol addition, b) 8.8 seconds 
after addition and c) 52.8 seconds after addition. 
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Figure 8. BHK cells stably transfected with the hM1 receptor and PKCa-F64L-S65T-GFP 
fusion were treated with carbachol (1 mM, 10 mM, 100 mM). In single cells intracellular 
[Ca^T was monitored simultaneously with the redistribution of PKCa-F64L-S65T-GFP. Da- 
5 shed line indicates the addition times of carbachol. The top panel shows changes in the in- 
tracellular Ca^* concentration of individual cells with time for each treatment. The middle pa- 
nel shows changes in the average cytoplasmic GFP fluorescence for individual cells against 
time for each treatment. The bottom pane! shows changes in the fluorescence of the perip- 
hery of single cells, within regions that specifically include the circumferential edge of a cell 
10 as seen in normal projection, the regions which offers best chance to monitor changes in the 
fluorescence intensity of the plasma membrane. 

Figure 9. a) The hERK1-F64L-S65T-GFP fusion expressed in HEK293 cells treated with 100 
mM of the MEK1 inhibitor PD98059 in HAM F-12 (without serum) for 30 minutes at 37 °C. 
1 5 The nuclei empty of fluorescence during this treatment. 

b) The same cells as in (a) following treatment with 10 % foetal calf serum for 15 minutes at 
37 X. 

c) Time profiles for the redistribution of GFP fluorescence in HEK293 cells following treat- 
ment with various concentrations of EGF in Hepes buffer (HAM F-12 replaced with Hepes 

20 buffer directly before the experiment). Redistribution of fluorescence is expressed as the 
change in the ratio value between areas in nucleus and cytoplasm of single cells. Each time 
profile is the mean for the changes seen in six single cells. 

d) Bar chart for the end-point measurements, 600 seconds after start of EGF treatments, of 
fluorescence change (nucleusxytoplasm) following various concentrations of EGF. 

25 

Figure 10. 

a) The SMAD2-EGFP fn^-- r-v^^^ 



b) HEK 293 cells expressing the SMAD2-EGFP fusion were treated with various concentra- 
tion of TGF-beta as indicated, and the redist^'^ii^ir^ - . 
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The time profile plots represent increases in fluorescence within the nucleus, normalised to 
starting values in each cell measured. Each trace is the time profile for a single cell nucleus. 

c) A bar chart representing the end-point change in fluorescence within nuclei (after 850 se- 
conds of treatment) for different concentrations of TGF-beta. Each bar is the value for a 
5 single nucleus in each treatment. 

Figure 11. The VASP-F64L-S65T-GFP fusion in CHO ceils stably transfected with the hu- 
man insulin receptor. The cells were starved for two hours in HAM F-12 without serum, then 
treated with 10% foetal calf serum. The image shows the resulting redistribution of fluore- 
10 scence after 15 minutes of treatment. GFP fluorescence becomes localised in structures 
identified as focal adhesions along the length of actin stress fibres. 

Figure 12, Time lapse recording GLUT4-GFP redistribution in CHO-HiR cells. Time indicates 
minutes after the addition of 100 nM insulin. 
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EXAMPLE 1 

5 Construction, testing and implementation of an assay for cAMP based on PKA activation in 
real time within living cells. 

Useful for monitoring the activity of signalling pathways which lead to altered concentrations 
of cAMP, e.g. activation of G-protein coupled receptors which couple to G-proteins of the Gs 
or G| class. 

10 

The catalytic subunit of the murine cAMP dependent protein kinase (PKAc)was fused C- 
terminally to a F64L-S65T derivative of GFP. The resulting fusion (PKAc-F64L-S65T-GFP) 
was used for monitoring in vivo the translocation and thereby the activation of PKA. 

Construction of the PKAc-F64L-S65T-GFP fusion: 

15 Convenient restriction endonuclease sites were introduced into the cDNAs encoding murine 
PKAc (Gen Bank Accession number: Ml 2303) and F64L-S65T-GFP (sequence disclosed in 
WO 97/11094) by polymerase chain reaction (PCR). The PGR reactions were performed ac- 
cording to standard protocols with the following primers: 

5'PKAc: TTggACACAAgCTTTggACACCCTCAggATATgggCAACgCCgCCgCCgCCAAg 
20 (SEQ ID N0:3), 

3'PKAc: gTCATCTTCTCgAgTCTTTCAggCgCgCCCAAACTCAgTAAACTCCTTgCCACAC 
(SEQ ID N0:4) . 

5'GFP: TTqnACAnAAnr'nn--^A'^A'^^--^'-^ ■ 

' "■'■■C " 'J \.AAyy i ' 1 ^ ! I MU 1 i CA I CCA 1 gCCA i y'T (SEQ ID 

NO:2). 
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The PKAc amplification product was then digested with Hindi Il+Asci and the F64L-S65T- 
GFP product with Ascl+Xhol. The two digested PGR products were subsequently ligated 
with a Hindlll+Xhol digested plasmid (pZeoSV® mammalian expression vector. Invitrogen, 
San Diego, CA, USA). The resulting fusion construct (SEQ ID NO:68 & 69) was under con- 
5 trol of the SV40 promoter. 

Transfection and cell culture conditions. 

Chinese hamster ovary cells (CHO), were transferted with the plasmid containing the PKAc- 
F64L-S65T-GFP fusion using the calcium phosphate precipitate method in HEPES-buffered 
saline (Sambrook etaL, 1989). Stable transfectants were selected using 1000 mg Zeocin/mi 

10 (Invitrogen) in the growth medium (DMEM with 1000 mg giucose/l, 10 % fetal bovine serum 
(PBS), 100 mg penicillin-streptomycin mixture ml-\ 2 mM L-glutamine purchased from Life 
Technologies inc., Gaithersburg, MD, USA). Untransfected CHO cells were used as the 
control. To assess the effect of glucagon on fusion protein translocation, the PKAc-F64L- 
S65T-GFP fusion was stably expressed in baby hamster kidney cells overexpressing the 

15 human glucagon receptor (BHK/GR cells) Untransfected BHK/GR cells were used as the 
control. Expression of GR was maintained with 500 mg G418/ml (A/eo marker) andPKAc- 
F64L-S65T-GFP was maintained with 500 mg Zeocln/ml {Sh ble marker). CHO cells were 
also simultaneously co-transfected with vectors containing the PKAc-F64L-S65T-GFP fusion 
and the human a2a adrenoceptor (hARa2a). 

20 For fluorescence microscopy, cells were allowed to adhere to Lab-Tek chambered cover- 
glasses (Nalge Nunc Int., Napervilie, IL, USA) for at least 24 hours and cultured to about 
80% confluence. Prior to experiments, the cells were cultured over night without selection 
pressure in HAM F-12 medium with glutamax (Life Technologies), 100 mg penicillin- 
streptomycin mixture ml'^ and 0.3 % FBS. This medium has low autofluorescence enabling 

25 fluorescence microscopy of cells straight from the incubator. 

'CGccpe ti:tea vviii. u \-,-../d\ -+0a, i\m, i,o on immersion oDjective ana coupled to a Photomet- 
30 rics CH250 charged coupled device (CCD) camera. The cells were illuminated with a 100 W 
HBO arc tamp. In the light path was h 470±20 nm ev-i*aticn "^i'^o^ n d^^^^n'^ ^^sr.^. 
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and a 515±15 nm emission filter for minimal Image background. The cells were kept and 
monitored to be at 37X with a custom built stage heater. 

Images were processed and analyzed in the following manner:. 

Method 1: Stepwise procedure for quantitation of translocation of PKA: 

5 1 . The image was corrected for dark cun-ent by performing a pixel-by-pixel subtraction of a 
dark image (an image taken under the same conditions as the actual image, except the 
camera shutter is not allowed to open). 

2. The image was corrected for non-uniformity of the illumination by performing a pixel-by- 
pixel ratio with a flat field correction image (an image taken under the same conditions as the 

10 actual image of a uniformly fluorescent specimen). 

3. The image histogram, i.e., the frequency of occurrence of each intensity value in the im- 
age, was calculated. 

4. A smoothed, second derivative of the histogram was calculated and the second zero is 
determined. This zero corresponds to the inflection point of the histogram on the high side of 

15 the main peak representing the bulk of the image pixel values. 

5. The value determined in step 4 was subtracted from the image. All negative values were 
discarded. 

6. The variance (square of the standard deviation) of the remaining pixel values was deter- 
mined. This value represents the "response" for that image. 

20 7. Scintillation proximity assay (SPA) for independent quantitation of cAMP: 
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Method 2: Alternative method for quantitation of PKA redistribution: 

1. The fluorescent aggregates are segmented from each image using an automatically 

5 found threshold based on the maximisation of the information measure between the object 
and background. The a priori entropy of the image histogram is used as the information me- 
asure. 

2. The area of each image occupied by the aggregates is calculated by counting pixels in 
the segmented areas. 

10 3. The value obtained in step 2 for each image in a series, or treatment pair, is normalised to 
the value found for the first (unstimulated) image collected. A value of zero (0) Indicates no 
redistribution of fluorescence from the starting condition. A value of one (1) by this method 
equals full redistribution. 

15 Cells were cultured in HAM F-12 medium as described above, but in 96-well plates. The me- 
dium was exchanged with Ca^'-HEPES buffer including 100 mM IBMX and the cells were 
stimulated with different concentrations of forskolin for 10 min. Reactions were stopped with 
addition of NaOH to 0.14 M and the amount of cAMP produced was measured with the 
cAMP-SPA kit. RPA538 (Amersham) as described by the manufacturer. 

20 

Manipulating intracellular levels of cAMP to test the PKAc-F64L-S65T-GFP fusion. 

The following compounds were used to vary cAMP levels: Forskolin, an activator of adeny- 
late cyclase; dbcAMP, a membrane permeable cAMP analog which is not degraded by 
phosphodiesterase; IBMX, an inhibitor of phosphodiesterase. 

25 CHO cells stably expressing the PKAc-F64L-S65T-GFP. showed a dramatic translocation of 
Fig. 2 shows the progression of response in time following treatment with 1 mM forskolin. 
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Fig. 3 gives a comparison of the average temporal profiles of fusion protein redistribution 
and a measure of the extent of each response to the three forskolin concentrations (Fig. 3A, 
E, B), and to 1 mM dbcAMP (fig 3C) which caused a similar but slower response, and to ad- 
dition of 100 mM IBMX (n=4, Fig. 3D) which also caused a slow response, even in the ab- 
5 sence of adenylate cyclase stimulation. Addition of buffer (n=2) had no effect (data not 
shown). 

As a control for the behavior of the fusion protein, F64L-S65T-GFP alone was expressed in 
CHO cells and these were also given 50 mM forskolin (n=5); the uniform diffuse distribution 
characteristic of GFP in these cells was unaffected by such treatment (data not shown). 

10 The forskolin induced translocation of PKAc-F64L-S65T-GFP showed a dose-response re- 
lationship (Fig 4 and 6), see quantitative procedures above. 

Reversibility of PKAc-F64L-S65T-GFP translocation. 

The release of the PKAc probe from its cytoplasmic anchoring hotspots was reversible. 
15 Washing the ceils repeatedly (5-8 times) with buffer after lOpM forskolin treatment comple- 
tely restored the punctate pattern within 2-5 min (n=2, Fig. 3E). In fact the fusion protein re- 
turned to a pattern of fluorescent cytoplasmic aggregates virtually indistinguishable from that 
observed before forskolin stimulation. 

To test whether the return of fusion protein to the cytoplasmic aggregates reflected a decre- 
20 ased [cAMP],, cells were treated with a combination of 10 mM forskolin and 100 mM IBMX 
(n=2) then washed repeatedly (5-8 times) with buffer containing 100 mM IBMX (Fig, 3F), In 
these experiments, the fusion protein did not return to its prestimulatory localization after 
removal of forskolin. 

25 Testing the PKA-F64L-S65T-GFP probe with physiologically relevant agents. 

To test the probe's response to receptor activation of adenvla^p rvrla^p PHu- ^nM.. - ».,v i 

.loenyiate cyclase, thereby increasing the cAMP level. !n these cells, addition of 100 nM glu- 
30 cagon (n^2) caused the release of the PKA-F64L-S65T-GFP probe from the cytoplasmic 

Ron^eantes and a rpsiiit'nc ...f r..,, ^ ...... ^..^ .. 
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distribution within 2-3 min (Fig. 3G). Similar but less pronounced effects were seen at lower 
glucagon concentrations (n=2, data not shown). Addition of buffer (n=2) had no effect over 
time (data not shown). 

Transiently transfected CHO cells expressing hARa2a and the PKA-F64L-S65T-GFP probe 
5 were treated with 10 mM forskolin for 7.5 minutes, then, in the continued presence of 
forskolin, exposed to 10 mM norepinephrine to stimulate the exogenous adrenoreceptors, 
which couple to a G, protein, which Inhibit adenylate cyclase. This treatmpnt led to reappear 
ance of fluorescence in the cytoplasmic aggregates indicative of a decrease in [cAMP], (Fig. 
3H). 

10 

Fusion protein translocation correlated with [cAMPJj 

As described above, the time it took for a response to come to completion was dependent on 
the forskolin dose (Fig. 5) In addition the degree of responses was also dose dependent. To 
test the PKA-F64L-S65T-GFP fusion protein translocation in a semi high through-put sy- 

15 stem, CHO cells stably transfected with the PKA-F64L-S65T-GFP fusion was stimulated 
with buffer and 5 increasing doses of forskolin (n=8). Using the image analysis algorithm de- 
scribed above (Method 1), a dose response relationship was observed in the range from 
0.01-50 mM forskolin (Fig. 6). A half maximal stimulation was observed at about 2 mM for- 
skolin. In parallel, cells were stimulated with buffer and 8 increasing concentrations of for- 

20 skolin (n=4) in the range 0.01-50 mM. The amount of cAMP produced was measured in an 
SPA assay. A steep increase was observed between 1 and 5 mM forskolin coincident with 
the steepest part of the curve for fusion protein translocation (also Fig. 6) 



25 EXAMPLE 2 



Construction of PKC-GFP fusion: 
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The probe was constructed by ligating two restriction enzyme treated polymerase chain re- 
action (PGR) amplification products of the cDNA for murine PKCa (GenBank Accession 
number: M25811) and F64L-S65T-GFP (sequence disclosed in WO 97/11094) respectively. 
Taq® polymerase and the following oligonucleotide primers were used for PGR; 

5 5'mPKCa: TTggACAGAAgGTTTggACACGCTCAggATATggGTgACgTTTAGGCggCCAACg 
(SEQ ID N0:5), 

3'mPKCa: gTCATCTTCTCgAgTCmCAggCgCgGCCTACTgGACTTTgCAAgATTgggTgC 
(SEQ ID N0:6), 

5'F64L-S65T-GFP: TTggACACAAgGTTTggAGACggCgGgCGATgAgTAAAggAgAAgAACTT- 
10 TIC (SEQ ID NO: 1), 

3'F64L-S65T-GFP: gTGATCTTCTGgAgTCTTAGTCGTgAggTTTgTATAgTTGATCGATgG- 
CATgT (SEQ ID NO:2). 

The hybrid DNA strand was inserted into the pZeoSV® mammalian expression vector as a 
HIndlll-Xhol casette as described in example 1. 



15 Gell Culture: 

BHK cells expressing the human Ml receptor under the control of the inducible metal- 
lothionine promoter and maintained with the dihydrofolate reductase marker were trans- 
fected with the PKGa-F64L-S65T-GFP probe using the calcium phosphate precipitate 
method in HEPES buffered saline (HBS [pH 7.10]). Stable transfectants were selected using 

20 1000 ijg Zeocin®/ml in the growth medium (DMEM with 1000 mg glucose/I, 10 % foetal bo- 
vine serum (FBS), 100 mg penicillin-streptomycin mixture ml-1, 2 mM l-glutamine). The hM1 
receptor and PKCa-F64L-S65T-GFP fusion protein were maintained with 500 nM 
methotrexate and 500 |jg Zeocin®/ml respectively. 24 hours prior to any experiment, the 
cells were transferred to HAM F-12 medium with glutamax, 100 pg penicillin-streptomycin 

25 mixture ml and 0.3 % FBS. This medium relieves selection pressure, gives a low induction 
of signal transduction oathwav?; nnd hn<^ n 'ow nv*^-'^' •^^n^^^^'-r. - — - 



'v'lonitonng tne PKG activity in real time: 

Digital images of live cells were gathered using a Zeiss Axiovert 135M fluorescence micro- 
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CH250 charged coupled device (CCD) camera. The cells were illuminated with a 100 W arc 
lamp. In the light path was a 470±20 nm excitation filter, a 510 nm dichroic mirror and a 
515±15 nm emission filter for minimal image background. The cells were kept and monitored 
to be at 37*'C with a custom built stage heater. 

5 Images were analyzed using the IPLab software package for Macintosh. 

Upon stimulation of the M1-BHK cells, stably expressing the PKCa-F64L-S65T-GFP fusion, 
with carbachol we observed a dose-dependent transient translocation from the cytoplasm to 
the piasma membrane (Fig. 7a,b,c). Simultaneous measurement of the cytosolic free cal- 
cium concentration shows that the carbachol-induced calcium mobilisation precedes the 
10 translocation (Fig. 8). 

Stepwise procedure for quantitation of translocation of PKC: 

1 . The image was corrected for dari^ current by performing a pixel-by-pixel subtraction of a 
dark image (an image taken under the same conditions as the actual image, except the 
camera shutter is not allowed to open). 

15 2. The image was corrected for non-uniformity of the illumination by performing a pixel-by- 
pixel ratio with a flat field correction image (an image taken under the same conditions as the 
actual image of a uniformly fluorescent specimen). 

3. A copy of the image was made in which the edges are identified. The edges in the image 
are found by a standard edge-detection procedure - convolving the image with a kerne! 

20 which removes any targe-scale unchanging components (i.e., background) and accentuates 
any small-scale changes (i.e., sharp edges). This image was then converted to a binary im- 
age by threshholding. Objects in the binary image which are too small to represent the 
edges of cells were discarded, A dilation of the binary image was performed to close any 
gaps in the image edges. Any edge objects in the image which were in contact with the bor- 

25 ders of the image are discarded. This binary image represents the edge mask. 

4. Another copy of image was made via the procedure in step 3, This conv v/rs fc-tho- 

i .',e original i^Tiage was nnasKed with tne edge mask from step 3 and the sum total of all 
30 pixel values is determined. 
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6. The original image was masked with the whole cell mask from step 4 and the sum total 
of all pixel values was determined. 

7. The value from step 5 was divided by the value from step 6 to give the final result, the 
fraction of fluorescence intensity in the cells which was localized in the edges. 

5 

EXAMPLE 3 

Probes for detection of mitogen activated protein kinase Erk1 redistribution. 

Useful for monitoring signalling pathways involving MAPK. e.g. to identify compounds which 
10 modulate the activity of the pathway in living cells. 

Erk1 , a serine/threonine protein kinase, is a component of a signalling pathway which is acti- 
vated by e.g. many growth factors. 

Probes for detection of ERK-1 activity in real time within living cells: 

15 The extracellular signal regulated kinase (ERK-1 , a mitogen activated protein kinase. MAPK) 
is fused N- or C-terminaliy to a derivative of GFP. The resulting fusions expressed in differ- 
ent mammalian cells are used for monitoring in vivo the nuclear translocation, and thereby 
the activation, of ERK1 in response to stimuli that activate the MAPK pathway. 

a) Construction of munne ERK1 - F64L-S65T-GFP fusion: 

20 Convenient restriction endonuclease sites are introduced into the cDNAs encoding murine 
ERK1 (GenBank Accession number: Z14249) and F64L-S65T-GFP (sequence disclosed in 
WO 97/11 094) by polymerase chain reaction (PGR). The PGR reactions are performed ac- 
cording to standard protocols with the following primers: 
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3'ERK1 : gTCATCTTCTCgAgTCTTTCAggCgCgCCCggggCCCTCTggCgCCCCTggCTgg 
(SEQ ID NO;8), 

5'F64L-S65T-GFP: TTggACACAAgCTTTggACACggCgCgCCATgAgTAAAggAgAAgAACTT- 
TTC (SEQ IDN0:1) 

5 3'F64L-S65T-GFP: gTCATCTTCTCgAgTCTTACTCCTgAggTTTgTATAgTTCATCCATgC- 
CATgT (SEQ ID NO:2) 

To generate the mERK1-F64L-S65T-GFP (SEQ ID NO:56 & 57) fusion the ERK1 amplifica- 
tion product is digested with Hindlll+AscI and the F64L-S65T-GFP product with Ascl+Xhol. 
To generate the F64L-S65T-GFP-mERK1 fusion the ERK1 amplification product is then di- 
10 gested with Hindlll+Bsu36l and the F64L-S65T-GFP product with Bsu36l+Xhol.The two 
pairs of digested PGR products are subsequently ligated with a Hindlll+Xhoi digested plas- 
mid (pZeoSV® mammalian expression vector, Invitrogen, San Diego, OA, USA). The resul- 
ting fusion constructs are under control of the SV40 promoter. 

15 b) The human Erk1 gene (QenBank Accession number: X60188) was amplified using PGR 
according to standard protocols with primers Erk1-top (SEQ ID N0:9) and Erk1- 
bottom/+stop (SEQ ID NO: 10) . The PGR product was digested with restriction enzymes E- 
coRI and BamH1, and ligated into pEGFP-G1 (Clontech, Palo Alto; GenBank Accession 
number U55763) digested with EcoR1 and BamH1. This produces an EGFP-Erk1 fusion 

20 (SEQ ID NO:38 &39) under the control of a CMV promoter. 

The plasmid containing the EGFP-Erki fusion was transfected into HEK293 cells employing 

the FUGENE transfection reagent (Boehhnger Mannheim). Prior to experiments the ceils 

were grown to 80%-90% confluency 8 well chambers in DMEM with 10% FCS. The ceils we- 

25 re washed in plain HAM F-12 medium (without FCS), and then incubated for 30-60 minutes 

in plain HAM F-12 (without FCS) with 100 micromolar PD98059, an inhibitor of MEK1 , a ki- 
nase which activates Erk1 : this ^^pp pffn-t- r.' ' 

...y. ,: , control experiments), tne inhibitor is included in the Hepes buffer. 
The experimental setup of the microscope was as described in eynmp*^ " 
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60 images were collected with 10 seconds between each, and with the test compound added 
after image number 10. 

Addition of EGF (1-100 nM) caused within minutes a redistribution of EGFP-Erkl from the 
5 cytoplasm into the nucleus (Fig. 9a, b). 

The response was quantitated as described below and a dose-dependent relationship bet- 
ween EGF concentration and nuclear translocation of EGFP-Erkl was found (Fig. 9c,d). Re- 
dltribution of GFP fluorescence is expressed in this example as the change in the ratio value 
between areas in nuclear versus cytoplasmic compartments of the cell. Each time profile is 
10 the average of nuclear to cytoplasmic ratios from six ceiis in each treatment. 

EXAMPLE 4: 

Probes for detection of Erk2 redistribution. 

Useful for monitoring signalling pathways involving MAPK. e.g. to identify compounds which 
1 5 modulate the activity of the pathway in living cells. 

Erk2, a serine/threonine protein kinase, is closely related to Erki but not identical; it is a 
component of a signalling pathway which is activated by e.g. many growth factors, 

a) The rat Erk2 gene (GenBank Accession number: M64300) was amplified using PGR ac- 
20 cording to standard protocols with primers Erk2-top (SEQ ID NO: 11) and Erk2-bottom/+stop 
(SEQ ID NO: 13) The PGR product was digested with restriction enzymes Xhol and BamH1, 
and ligated into pEGFP-CI (Clontech, Palo Alto; GenBank Accession number U55763) dige- 
sted with Xho1 and BamHI. This produces an EGFP-Erk2 fusion (SEQ ID NO:40 &41) under 
the control of a CMV promoter. 

25 b) The rat Erk2 gene (GenBank Accession number: M64300) was amplified using PGR ac- 
cording to standard protocols vvit^^ --r-r- '^r^ - ' 

- :■ /r^ei/ . -:^)MiK".: I -t^nbanK ACce5Siot , nunner U55762j dige- 

oiea with Xho1 and BamHI . This produces an Erk2-EGFP fusion (SEQ ID NO:58 &59) under 
30 the control of a CMV promoter. 
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The resulting ptasmids were transfected into CHO celts and BHK cells. The cells were grown 
under standard conditions. Prior to experiments, the cells were starved in nriedium without 
serunn for 48-72 hours. This led to a predominantly cytoplasmic localization of both probes, 
5 especially in BHK cells. 10% fetal calf serum was added to the cells and the fluorescence of 
the cells was recorded as explained in example 3. Addition of serum caused the probes to 
redistribute into the nucleus within minutes of addition of serum. 

EXAMPLE 5: 

10 Probes for detection of Smad2 redistribution. 

Useful for monitoring signalling pathways activated by some members of the transforming 
growth factor-beta family, e.g. to identify compounds which modulate the activity of the pa- 
thway in living cells. 

Smad 2, a signal transducer, is a component of a signalling pathway which is induced by 
1 5 some members of the TGFbeta family of cytokines. 

a) The human Smad2 gene (GenBank Accession number: AF027964) was amplified using 
PGR according to standard protocols with primers Smad2-top (SEQ ID NO:24) and Smad2- 
bottom/+stop (SEQ ID NO:26) . The PGR product was digested with restriction enzymes E- 

20 coR1 and Acc65i, and ligated into pEGFP-C1 (Clontech; Palo Alto; GenBank Accession 
number U55763) digested with EcoR1 and Acc65t. This produces an EGFP-Smad2 fusion 
(SEQ ID NO:50&51) under the control of a CMV promoter. 

b) The human Smad2 gene (GenBank Accession number: AF027964) was amplified using 
PGR according to standard protocols with pnmers Smad2-top (SEQ ID NO:24) and Smad2- 

25 bottom/-5top (SEQ ID NO:25) . The PGR product was digested with restriction enzymes E- 

coR1 and Acc651, and ligated into pEGFP-N1 (Clontech, Palo Alto; GenBank A^rorrir^^ 
number U5576?> di^neto-^^ . - - 



3C The plasmid containing the EGFP-Smad2 fusion was transfected into HEK293 ce'l? - i-o^r-. - 
showed a cvtoDla?n-'r ^ir*"i^^i:f :r- ^r, n v-r. — r - *^- .. ... ■ .*..■ ,■ 
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Chambers in DMEM with 10% FCS to 80% confluency and starved overnight in HAM F-12 
nnedium without FCS. 

For experiments, the HAM F-12 medium was replaced with Hepes buffer pH 7.2. 

The experimental setup of the microscope was as described in example 1. 

5 90 images were collected with 10 seconds between each, and with the test compound added 
after Image number 5. 

After serum starvation of cells, each nucleus contains less GFP fluorescence than the sur- 
rounding cytoplasm (Fig. 10a). Addition of TGFbeta caused within minutes a redistribution of 
10 EGFP-Smad2 from the cytoplasma into the nucleus (Fig. 10b). 

The redistribution of fluorescence within the treated cells was quantified simply as the fracti- 
onal increase in nuclear fluorescence normalised to the starting value of GFP fluorescence 
in the nucleus of each unstimulated cell. 

15 EXAMPLES: 

Probe for detection of VASP redistribution. 

Useful for monitoring signalling pathways involving rearrangement of cytoskeletal elements, 
e.g. to identify compounds which modulate the activity of the pathway in living cells. 

VASP, a phosphoprotein, is a component of cytoskeletal structures, which redistributes in 
20 response to signals which affect focal adhesions. 

a) The human VASP gene (GenBank Accession number: Z46389) was amplified using PGR 
according to standard protocols with primers VASP-top (SEQ ID NO:94) and VASP- 
bottom/+stop (SEQ ID NO:95). The PGR product was digested with restriction enzymes 
Hind3 and BamH1, and ligated into pEGFP-CI (Clontech, Palo Alto; GenBank Accession 
25 number U55763) digested with HindSand BamHI. This produces an EGFP-VAqn r,,^;,.. 
(SEQ ID NO-124 K^?"^' 

,:-a tr^o caicii !n--rncGcna:t^ iransTection method. Prior to experiments, ceils were grown in 
8 well Nunc chambers and starved overnight in medium without FCS. 
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Experiments are performed in a microscope setup as described in example 1. 

10% FCS was added to the cells and images were collected. The EGFP-VASP fusion was 
redistributed from a somewhat even distribution near the periphery into more localized 
structures, identified as focal adhesion points (Fig. 11). 

5 

A large number of further GFP fusions have been made or are in the process of being made, 
as apparent from the following Examples 7-22 which also suggest suitable host cells and 
substances for activation of the cellular signalling pathways to be monitored and analyzed. 

10 



EXAMPLE 7: 

Probe for detection of actin redistribution, 

15 Useful for monitoring signalling pathways involving rearrangement or formation of actin fila- 
ments, e.g. to identify compounds which modulate the activity of pathways leading to cyto- 
skeletal rearrangements in living ceils. 

Actin is a component of cytoskeletal structures, which redistributes in response to very many 
cellular signals. 

20 

The actin binding domain of the human alpha-actinin gene (GenBank Accession number: 
XI 5804) was amplified using PGR according to standard protocols with primers ABD-top 
(SEQ ID NO:90) and ABD-bottomAstop (SEQ ID N0:91). The PGR product was digested 
with restriction enzymes Hind3 and BamHI, and ligated into pEGFP-N1 (Clontech, Palo Alto; 
25 GenBank Accession number U55762) digested with Hind3 and BannM^ t^'- ' 

' c rosuijr;g piasmid was transfected into CHO cells expressing the human insulin receptor. 
Cells were stimulated with insulin which caused the actin binding domain-EGFP orobe tr, 
30 come redistributed into ^--------i -^r^*y-'-y . - a:*:' - - ^/l: 
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Example 8: 

Probes for detection of p38 redistribution. 

5 Useful for monitoring signalling pathways responding to various cellular stress situations, 
e.g. to identify compounds which modulate the activity of the pathway in living cells, or as a 
counterscreen. 

p38, a serine/thronine protein kinase, is a component of a stress-induced signalling pathway 
which is activated by many types of cellular stress, e.g. TNFalpha, anisomycin, UV and mi- 
10 tomycin C. 

a) The human p38 gene (GenBank Accession number: L35253) was amplified using PGR 
according to standard protocols with primers p38-top (SEQ ID NO: 14) and p38- 
bottom/+stop (SEQ ID NO: 16). The PGR product was digested with restriction enzymes 

15 Xho1 and BamHi, and ligated into pEGFP-C1 (Clontech, Palo Alto; GenBank Accession 

number U55763) digested with Xho1 and BamHi. This produced an EGFP-p38 fusion (SEQ 
ID NO:46 &47) under the control of a GMV promoter. 

b) The human p38 gene (GenBank Accession number: L35253) was amplified using PGR 
according to standard protocols with primers p38-top (SEQ ID NO:13) and p38-bottom/-stop 

20 (SEQ ID NO: 15) . The PGR product was digested with restriction enzymes Xho1 and 
BamHi, and ligated into pEGFP-N1 (Clontech, Palo Alto; GenBank Accession number 
U55762) digested with Xhol and BamHi This produced a p38-EGFP fusion (SEQ ID NO:64 
&65) under the control of a GMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. HEK293, in which the 

25 EGFP-p38 probe and/or the p38-EGFP probe should change its cellular distribution from 

predominantly cytoplasmic to nuclear within minutes in response to activnti^- 
line rfR^hwnv v'-'^^*^ - ■ 

30 Probes for detection of Jnki redistribution. 
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Useful for monitoring signalling pathways responding to various cellular stress situations, 
e.g. to identify compounds which modulate the activity of the pathway in living cells, or as a 
counterscreen. 

Jnk1, a serine/threonine protein kinase, is a component of a stress-induced signalling pa- 
5 thway different from the p38 described above, though it also is activated by many types of 
cellular stress, e.g. TNFalpha. anisomycin and UV. 

a) The human Jnk1 gene (GenBank Accession number: L26318) was amplified using PGR 
according to standard protocols with primers Jnk-top (SEQ ID NO: 17) and Jnk-bottom/+stop 

10 (SEQ ID N0:19) . The PGR product was digested with restriction enzymes Xhol and 
BamH1, and ligated into pEGFP-C1 (Clontech, Palo Alto; GenBank Accession number 
U55763) digested with Xho1 and BamH1. This produced an EGFP-Jnk1 fusion (SEQ ID 
NO:44 &45) under the control of a CMV promoter, 

b) The human Jnki gene (GenBank Accession number: L26318) was amplified using PGR 
15 according to standard protocols with primers Jnk-top (SEQ ID NO: 17) and Jnk-bottom/-stop 

(SEQ ID NO:18) . The PGR product was digested with restriction enzymes Xhol and 
BamH1, and ligated into pEGFP-N1 (Glontech, Palo Alto; GenBank Accession number 
U55762) digested with Xho1 and BamHl This produced a Jnkl-EGFP fusion (SEQ ID 
NO:62 &63) under the control of a CMV promoter. 

20 The resulting ptasmids are transfected into a suitable cell line, e.g. HEK293, in which the 
EGFP-Jnk1 probe and/or the Jnk1-EGFP probe should change its cellular distribution from 
predominantly cytoplasmic to nuclear in response to activation of the signalling pathway with 
e.g. anisomycin. 

25 

Example 10: 

Probes for detectinn nf own ^ - * 

:::^::^cunc:s - n!,--^ '-zz^,c^:c ine activity of the oainway in living cells. 

3G PGK, a cGMP-dependent serine/threonine protein kinase, mediates the auanvlv'- 
cyclase/cGMP siqna! 
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a) The human PKG gene (GenBank Accession number: Y07512) is amplified using PGR ac- 
cording to standard protocols with primers PKG-top (SEQ ID N0:81 ) and PKG-bottom/+stop 
(SEQ ID NO:83) . The PGR product is digested with restriction enzymes Xho1 and BamH1, 

5 and ligated into pEGFP-GI (Clontech, Palo Alto; GenBank Accession number U55763) dige- 
sted with Xho1 and BamH1. This produces an EGFP-PKG fusion (SEQ ID NO; 134 &135) 
under the control of a CMV promoter. 

b) The human PKG gene (GenBank Accession number: Y07512) is amplified using PGR ac- 
cording to standard protocols with primers PKG-top (SEQ ID NO:81) and PKG-bottom/-stop 

10 (SEQ ID NO: 82) . The PGR product is digested with restriction enzymes Xho1 and BamH1, 
and ligated into pEGFP-NI (Clontech, Palo Alto; GenBank Accession number U55762) dige- 
sted with Xho1 and BamH1. This produces a PKG-EGFP fusion (SEQ ID NO:136 &137) un- 
der the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable ceil line, e.g. A10, in which the EGFP- 
15 PKG probe and/or the PKG-EGFP probe should change its cellular distribution from Cy1o- 
plasmic to one associated with cytoskeietal elements within minutes in response to treatment 
with agents which raise nitric oxide (NO) levels. 

Example 11: 

20 Probes for detection of IkappaB kinase redistribution. 

Useful for monitoring signalling pathways leading to NFkappaB activation, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 

IkappaB kinase, a serine/threonine kinase, is a component of a signalling pathway which is 
activated by a variety of inducers including cytokines, lymphokines, growth factors and 
25 stress. 

n) Thp r^^^y^^ - 

— ^•^■u.b^: am iKK>:.c::c;ii -step [SEQ ID NO:98). The PGR product is digested with 
0. restriction enzymes EcoRI and Acc65l, and ligated into pEGFP-C1 (Clontech, Palo Alto: 
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GenBank Accession number U55763) digested with EcoRland Acc651. This produces an 
EGFP-lkappaB-kinase fusion (SEQ ID NO:120 &121) under the control of a CMV promoter. 

b) The alpha subunit of the human IkappaB kinase gene (GenBank Accession number: 
AF009225) is amplified using PGR according to standard protocols with primers IKK-top 
5 (SEQ ID NO:96) and iKK-bottom/-stop (SEQ ID NO:97). The PGR product is digested with 
restriction enzymes EcoR1 and Acc65i, and ligated into pEGFP-N1 (Clontech, Palo Alto; 
GenBank Accession number U55762) digested with EcoR1 and Acc651. This produces an 
IkappaB-kinase-EGFP fusion (SEQ ID NO:122 &123) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
0 EGFP-lkappaB-kinase probe and/or the IkappaB-kinase-EGFP probe should achieve a more 
cytoplasmic distribution within seconds following stimulation with e.g. TNFalpha. 



Example 12: 

Probes for detection of CDK2 redistribution. 

15 Useful for monitoring signalling pathways of the cell cycle, e.g. to identify compounds which 
modulate the activity of the pathway in living cells. 

CDK2, a cyclin-dependent serine/threonine kinase, is a component of the signalling system 
which regulates the cell cycle. 



20 a) The human CDK2 gene (GenBank Accession number: X61622) is amplified using PGR 
according to standard protocols with primers CDK2-top (SEQ ID NO: 102) and CDK2- 
bottom/+stop (SEQ ID NO: 104). The PGR product is digested with restriction enzymes Xhol 
and BamH1 , and ligated into pEGFP-C1 (Clontech, Palo Alto; GenBank Accession number 
U55763) digested with Xhol and BamHI. This produces an EGFP-CDK2 fusion (SEQ ID 

25 NO: 114 &115) under the control of a CMV promoter. 

b) The human CDK2 gene (GenBank Accession numbe- YRi^?':^^ " 
accordino to ^tn^^-'- ■ 

f:aaten in*'- c^3--'-N: .L^iontech, Palo Alto; GenBank Accession number 

J55762) digested with Xho1 and BamHI . This produces a CDK2-EGFP fusion (SEQ ID 
N0:1 12 &113) under the control of a CMV prn^-t^- 
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The resulting plasmids are transfected into a suitable cell line. e.g. HEK293 in which the 
EGFP-CDK2 probe and/or the CDK2-EGFP probe should change its cellular distribution 
from cytoplasmic in contact-inhibited cells, to nuclear location in response to activation with a 
nunnber of growth factors, e.g. IGF. 

Example 13: 

Probes for detection of Grk5 redistribution. 

Useful for monitoring signalling pathways involving desensitization of G-protein coupled re- 
ceptors, e.g. to identify compounds which modulate the activity of the pathway in living cells. 

Grk5, a G-protein coupled receptor kinase, is a component of signalling pathways involving 
membrane bound G-protein coupled receptors. 



a) The human Grk5 gene (GenBank Accession number; L15388) is amplified using PGR ac- 
cording to standard protocols with primers Grk5-top (SEQ ID NO;27) and Grk5- 

15 bottom/+stop (SEQ ID NO:29). The PGR product is digested with restriction enzymes EcoR1 
and BamH1, and ligated into pEGFP-CI (Clontech, Palo Alto; GenBank Accession number 
U55763) digested with EcoR1 and BamH1. This produces an EGFP-Grk5 fusion (SEQ ID 
NO:42 &43) under the control of a CMV promoter. 

b) The human Grk5 gene (GenBank Accession number: L15388) is amplified using PGR ac- 
20 cording to standard protocols with primers Grk5-top (SEQ ID NO:27) and GrkS-bottomZ-stop 

(SEQ ID NO:28). The PGR product is digested with restriction enzymes EcoR1 and BamH1 , 
and ligated into pEGFP-N1 (Clontech, Palo Alto; GenBank Accession number U55762) dige- 
sted with EcoR1 and BamH1. This produces a Grk5-EGFP fusion (SEQ ID NO:60 &61) un- 
der the control of a CMV promoter. 

25 The resulting plasmids are transfected into a suitable cell line, e.g. HEK293 expressing a rat 

dopamine D1 A receptor, in which the EGFP-Grk5 probe and/or the Grk^^pr^t^"^ 

should change its celi'i'nr nir^r-,^ * 



30 Example 14: 
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Probes for detection of Zap70 redistribution. 

Useful for monitoring signalling pathways involving the T cell receptor, e.g. to identify com- 
pounds which modulate the activity of the pathway in living cells, 

Zap70, a tyrosine kinase, is a component of a signalling pathway which is active in e.g. T-cell 
5 differentiation. 

a) The human Zap70 gene (GenBank Accession number: L05148) is amplified using PGR 
according to standard protocols with primers Zap70-top (SEQ ID NO: 105) and Zap70- 
bottom/+stop (SEQ ID NO; 107). The PGR product is digested with restriction enzymes E- 

10 coRI and BamH1, and ligated into pEGFP-G1 (GenBank Accession number U55763) dige- 
sted with EcoRI and BamH1. This produces an EGFP-Zap70 fusion (SEQ ID NO:108 &109) 
under the control of a CMV promoter. 

b) The human Zap70 gene (GenBank Accession number: L05148) is amplified using PGR 
according to standard protocols with primers Zap70-top (SEQ ID NO: 105) and Zap70- 

15 bottom/-stop (SEQ ID NO: 106). The PGR product is digested with restriction enzymes E- 
coRI and BamHI, and ligated into pEGFP-NI (Glontech. Palo Alto; GenBank Accession 
number U55762) digested with EcoR1 and BamHI. This produces a Zap70-EGFP fusion 
(SEQ ID NO:1 10 &1 1 1) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
20 EGFP-Zap70 probe and/or the Zap70-EGFP probe should change its cellular distribution 
from cytoplasmic to membrane-associated within seconds in response to activation of the T 
cell receptor signalling pathway with e.g. antibodies to CD3epsilon. 

Example 15: 

25 Probes for detection of p85 redistribution. 

Useful for monitoring signalling pathways involvinq PI-3 Vapr^^ ^ ' 
which pnr^dM'n^f-^ 

■■■■^ — ^'^''^-'-^Hvunc: tyrosine kinase receptors and G-protein-coupted receptors. 

30 
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a) The human p85alpha gene (GenBank Accession number: M61906) was amplified using 
PGR according to standard protocols with primers p85-top-C (SEQ ID NO:22) and p85- 
bottom/+stop (SEQ ID NO:23) . The PGR product was digested with restnction enzymes 
BgI2 and BamH1. and ligated into pEGFP-G1 (Glontech. Palo Alto; GenBank Accession 

5 number U55763) digested with Bg!2 and BamH1 . This produced an EGFP-p85alpha fusion 
(SEQ ID NO:48 &49) under the control of a CMV promoter. 

b) The human p85alpha gene (GenBank Accession number: M61906) was amplified using 
PGR according to standard protoGois with primers p85-top-N (SEQ ID NO:20) and p85- 
bottom/-stop (SEQ ID N0:21) . The PGR product was digested with restriction enzymes E- 

10 coR1 and BamH1, and ligated into pEGFP-N1 (Glontech, Palo Alto; GenBank Accession 
number U55762) digested with EcoR1 and BamH1. This produced a p85alpha-EGFP fusion 
(SEQ ID NO:66 &67) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. CHO expressing the hu- 
man insulin receptor, in which the EGFP-p85 probe and/or the p85-EGFP probe may change 
15 its cellular distribution from cytoplasmic to membrane-associated within minutes in response 
to activation of the receptor with insulin. 

Example 16: 

Probes for detection of protein-tyrosine phosphatase redistribution. 

20 Useful for monitoring signalling pathways involving tyrosine kinases, e.g. to identify com- 
pounds which modulate the activity of the pathway in living cells. 

Protein-tyrosine phosphataseIC, a tyrosine-specific phosphatase, is an inhibitory compo- 
nent in signalling pathways involving e.g. some growth factors. 

25 a) The human protein-tyrosine phosphatase 1C gene (GenBank Accession number: X62055) 
Is amplified using PGR according to standard protocols with primers PTP-tnn ir- 
NO:99)and PTP-bntt^n^^^ - ' 

' .^cnsTea x:"^ , arc tcoK ; . This proauces an EGFP-PTP 

jsicn (SEQ ID N0:116 8c117) under the control of a CMV promoter. 
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-°-^) and PrP..o»c^.s,op LT^TTT ^'"^ ^""-"^ » 

' "umber .55762, digested w* Z Z7 *^ 

<SBO ,0 NO:„a under th "^'^ ™= — = PTP....p 

T^-esu,.tngp,as.i,sate,rans,ec,ed, ^''*'^^°™"'- 
E«^P-PTP proba and/or j!. 1"'° = « -9. MCP-7 ,„ 

cytoptas. to the plas™ msnb^neX™^ «"'-«bu.o„;™ 
'« 'hNbitory s,gna,„„g pa„^, ^ ^ -ponse to activabon ofthe gro«h 

Example 17; 

frobea tor detection of Smad4redisb,bu«on 

tf^way in living cells. -'"^-"^^ ^^i*ch modulate the activity of ttie pa- 

Smad4, a signal transducer is a com 

^ ™n,be. 0, the TGPbeta ,=,0,,^ °' '^'^-'^ va- 



20 



25 



a) The human Smad4 gene /Gphr a 

^CR according to standard rotoc„r«r"'°" ™^ -ing 

(3E0 ,D NO:35, . The PCP ^0^^ 0, 7 ^™«-^»--s.o 
=»H,. and .gated into pEGFP c 1 1?? ~ ^"^'"^ ^""^ -d 
"«763, digested * HcoR, and b J„ " i ° *--ion „„.,er 

«3, under the control „, a CMV prorir'"'" " '"''"'"^^ '° 



b) The human Smad4 qene rr»nD , . 
PCR according ,- J""^^* ""-"en U443™, 



-mber U55762, digested «tn EcoRra^p ' '*=«3s,on 
'0 -7e under cot,:: r.:" = — - -on 
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The resulting plasmids are transfected into a cell line, e.g. HEK293 in which the EGFP- 
Smad4 probe and/or the Smad4-EGFP probe should change its cellular distribution within 
nninutes fronn cytoplasmic to nuclear in response to activation of the signalling pathway with 
e.g. TGFbeta. 

5 

Example 18: 

Probes for detection of StatS redistribution. 

Useful for monitoring signalling pathways involving the activation of tyrosine kinases of the 
Jak family, e.g. to identify compounds which modulate the activity of the pathway in living 
10 cells. 

Stats, signal transducer and activator of transcription, is a component of signalling pathways 
which are induced by e.g. many cytokines and growth factors. 

a) The human StatS gene (GenBank Accession number: L41142) was amplified using PGR 
15 according to standard protocols with primers Stat5-top (SEQ ID NO;30) and StatS- 

bottom/+stop (SEQ ID NO:32). The PGR product was digested with restriction enzymes Bgl2 
and Acc65l, and ligated into pEGFP-C1 (Clontech; Palo Alto; GenBank Accession number 
US5763) digested with Bgl2 and Acc65l. This produced an EGFP-StatS fusion (SEQ ID 
NO;S4 &S5) under the control of a CMV promoter. 

20 b) The human StatS gene (GenBank Accession number: L41 142) was amplified using PGR 
according to standard protocols with primers StatS-top (SEQ ID NO:30) and StatS-bottom/- 
stop (SEQ ID NO:331). The PGR product was digested with restriction enzymes Bgl2 and 
Acc6Sl, and ligated into pEGFP-N1 (Clontech, Palo Alto; GenBank Accession number 
U5S762) digested with Bgl2 and Acc65L This produced a StatS-EGFP fusion (SEQ ID NO:78 

25 &79) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e a MINR in '.vh-^- - - - 
Stats probe nnd'^'- c*^.ir ---^ 
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Example 19: 

Probes for detection of NFAT redistribution. 

Useful for nnonitoring signalling pathways involving activation of NFAT, e.g. to identify com- 
pounds which modulate the activity of the pathway in living cells. 

5 NFAT, an activator of transcription, is a component of signalling pathways which is involved 
in e.g. immune responses. 

a) The human NFAT1 gene (GenBank Accession number: U43342) is amplified using PGR 
according to standard protocols with primers NFAT-top (SEQ ID NO:84) and NFAT- 

10 bottom/+stop (SEQ ID NO:86). The PGR product is digested with restriction enzymes Xho1 
and EcoRI . and iigated into pEGFP-G1 (Clontech, Paio Alto; GenBank Accession number 
U55763) digested with Xho1 and EcoRl This produces an EGFP-NFAT fusion (SEQ ID 
NO: 130 &131) under the control of a GMV promoter. 

b) The human NFAT gene (GenBank Accession number: U43342) is amplified using .PGR 

15 according to standard protocols with primers NFAT-top (SEQ ID NO:84) and NFAT-bottom/- 
stop (SEQ ID NO:85), The PGR product is digested with restriction enzymes Xho1 and E- 
coR1. and Iigated into pEGFP-N1 (Glontech, Palo Alto; GenBank Accession number 
U55762) digested with Xho1 and EcoRI. This produces an NFAT-EGFP fusion (SEQ ID 
NO:132 &133) under the control of a GMV promoter. 

20 The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-NFAT probe and/or the NFAT-EGFP probe should change its cellular distribution 
from cytoplasmic to nuclear within minutes in response to activation of the signalling pathway 
with e.g. antibodies to GD3epsiton. 

25 Example 20: 

Probes for detection of NFkappaB redistribution 
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NFkappaB, an activator of transcription, is a connponent of signalling pathways which are 
responsive to a verity of inducers including cytokines, lymphokines, some immunosuppress 
ve agents, 

5 a) The human NFkappaB p65 subunit gene (GenBank Accession number: M62399) Is am- 
plified using PGR according to standard protocols with primers NFkappaB-top (SEQ ID 
NO:87) and NFkappaB-bottom/+stop (SEQ ID NO:89). The PCR product is digested with 
restriction enzymes Xhol and BamH1, and ligated into pEGFP-C1 (Clontech, Palo Alto; 
GenBank Accession number U55763) digested with Xho1 and BamH1. This produces an 
10 EGFP-NFkappaB fusion (SEQ ID NO: 142 & 143) under the control of a CMV promoter 

b) The human NFkappaB p65 subunit gene (GenBank Accession number: M62399) is am- 
plified using PCR according to standard protocols with primers NFkappaB-top (SEQ ID 
NO:87) and NFkappaB-bottom/-stop (SEQ ID NO:88). The PCR product is digested with re- 
striction enzymes Xhol and BamH1, and ligated into pEGFP-N1 (Clontech, Palo Alto; Gen- 
1 5 Bank Accession number U55762) digested with Xho1 and BamH1 . This produces an 
NFkappaB-EGFP fusion (SEQ ID NO:140 & 141) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-NFkappaB probe and/or the NFkappaB-EGFP probe should change its cellular distri- 
bution from cytoplasmic to nuclear in response to activation of the signalling pathway with 
20 e.g. TNFalpha. 

Example 21: 

Probe for detection of RhoA redistribution. 

Useful for monitoring signalling pathways involving RhoA, e.g. to identify compounds which 
25 modulate the activity of the pathway in living cells. 

RhoA, a small GTPase, is a component of many signallinq oathwi^v*^ ^ i p a . 
skeletal re^irrrt'^^n^-^'^* 

* :iu:nar^ KhoA gene (GenBank Accession number: L25080) was amplified using PCR 
30 according to standard protocols with primers RhoA-top (SEQ ID a^d Rhc- 

bottom/+ston fSEQ ID '^JO P" ■ ^^^'^ ^^^R 
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Hind3 and BamH1, and ligated into pEGFP-C1 (Clontech. Palo Alto; GenBank Accession 
number U55763) digested with Hind3and BamHI. This produced an EGFP-RhoA fusion 
(SEQ ID NO: 126 &127) under the control of a CMV promoter. 

The resulting plasmid is transfected into a suitable cell line, e.g. Swiss3T3, in which the 
5 EGFP-RhoA probe should change its cellular distribution from a reasonably homogenous to 
a peripheral distribution within minutes of activation of the signalling pathway with e.g. LPA. 

Example 22: 

Probes for detection of PKB redistribution. 

Useful for monitoring signalling pathways involving PKB e.g. to identify compounds which 
10 modulate the activity of the pathway in living cells. 

PKB, a serine/threonine kinase, is a component in various signalling pathways, many of 
which are activated by growth factors. 

a) The human PKB gene (GenBank Accession number: M63167) is amplified using PGR ac- 
15 cording to standard protocols with primers PKB-top (SEQ ID NO:36) and PKB-bottom/+stop 
(SEQ ID NO:80) . The PGR product is digested with restriction enzymes Xho1 and BamHI, 
and ligated into pEGFP-C1 (Clontech, Palo Alto; GenBank Accession number U55763) dige- 
sted with Xhol and BamHI. This produces an EGFP-PKB fusion (SEQ ID NO:138 & 139) 
under the control of a CMV promoter. 

20 b) The human PKB gene (GenBank Accession number: M63167) was amplified using PGR 
according to standard protocols with primers PKB-top (SEQ ID NO:36) and PKB-bottom/- 
stop (SEQ ID NO:37) . The PGR product was digested with restriction enzymes Xho1 and 
BamHI, and ligated into pEGFP-NI (Clontech, Palo Alto; GenBank Accession number 
U55762) digested with Xhol and BamHI. This produced a PKB-EGFP fusion (SEQ ID 

25 NO:70 &71) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e a CHO evprpc^r r - 

■ .t _ i.'^TiliC. ' ^' ''^^>0^:Ji:■ 

^ /. J c^JO;:icn r -^sui:- - ■-■ :-ansii!wn sdcu^g oe apparent within minutes. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



56 

REFERENCES: 

Adams, S.R., Harootunian, AT., Buechler, YJ., Taylor, S.S. & Tsien. R.Y. (1991) Nature 
349, 694-697 

5 Biobe, G.C., Stribltng, D.S., Fabbro, D,, Stabel, S & Hannun, Y.A. (1996) J, Biol. Chem. 271, 
15823-15830 

Chalfie, M., Tu. Y., Euskirchen, G., Vvdrd, W.W. & Prasher, D.C. (1994) Science 263, 802- 
805 

Cossette, LJ., Hoglinger, O., Mou, LJ. & Shen, S,H. (1997) Exp. Cell Res. 223, 459-466 
10 DeBernardi, M.A. & Brooker, G. (1996) Proc. Natl. Acad. Sci. USA 93, 4577-4582 
Farese, R.V.. (1992) Biochem. J, 288, 319-323 

Fulop Jr., T., Leblanc, C, Lacombe, G. & Dupuis, G. (1995) FEBS Lett. 375. 69-74 

Godson, Masliah, E., Balboa, MA, Ellisman, M.H. & Insel, P.A. (1996) Biochem. Bio- 
phys. Acta 1313, 63-71 

15 Khaiil, R.A., Lajoie, C, Resnick, M.S. & Morgan, K.G. (1992) American Physiol. Society c, 
714-719 

Sano, M., Kohno, M. & Iwanaga, M, (1995) Brain Res. 688, 213-218 

Basttaens, P.I.H. & Jovin, T.M. (1996) Proc. Natl. Acad. Sci. USA 93, 8407-8412 

Schmidt, D.J., ikebe, M., Kitamura, K.. & Fay. F.S. (1997) FASEB J. 11, 2924 (Abstract) 

20 Sakai. N., Sasaki. K., Hasegawa, C, Ohkura. M., Suminka, K., Shirai, Y. & Saito, N. (1996) 
Soc. Neuroscience 22, 69P (Abstract) 

Sakai, N., Sakai, K. Hasegawa, C, Ohkura, M., Sumioka, ., Shirai, Y., & Naoaki. S. (1997) 
Japanese Journal of Pharmacology 73, 69P (Abstract of a meeting held 22-23 March) 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



57 



PCT/DK98/00145 



10 



15 



20 



25 



30 



35 



40 



45 



SEQUENCE LISTING 

(1) GENERAL INFORMATION 

fii) TITLE OF THE INVENTION . 

Fluorescense in,aging " "''^ ^"^^^^ Polypeptides Using 
(iii) NUMBER OF SEQUENCES: 

(iy) CORRESPONDENCE ADDRES<; 
(A) ADDRESSRP. m ^"^^S: 

(D) STATE: DK 

(E) COUNTRY: DENMARK 
<F) ZIP: 2860 

(V) COMPUTER READABLE FORM- 
A MEDIU^, TYPE: Diskette 
(B) COMPUTER- IBM n„ 

OPERATINO sStem"',^;^ 
SOFTWARE: FastSEO T.l 

...uuows Version 2.0 

~-N: 

(B) REGISTRATION NUMBER - 

(C) REFERENCE/DOCKET NUMBER: 

(2) INFORMATION FOR SeQ Id no : I : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 5 3 baso ' ' 

(B) TYDc- pairs 
B TYPE: nucleic acid 

C STRANDEDMESS : single 
W TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTinw or. 

'-KIPTION: SEQ ID NO - 1 • 

^TOGACACAA GCTTTGGACA CGGCGCGCCA TGAO. 

^GCCA TGAGTAAAGG AGAAGAACTT TTr 



' -PH : nucleic 



'n^ --^xeic acid 

55 SxRANDED^ESS: siTnr, 



fD) TOPOLOGY Ti"" """"^^^ 



inear 



SUBSTITUTE SHEFT (RULE 26) 



wo 98/45704 PCT/DK98/001 45 

58 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GTCATCTTCT CGAGTCTTAC TCCTGAGGTT TGTATAGTTC ATCCATGCCA TGT 5 3 

5 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 54 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGGCAACG CCGCCGCCGC CAAG 54 
(2) INFORMATION FOR SEQ ID NO : 4 : 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

30 GTCATCTTCT CGAGTCTTTC AGGCGCGCCC AAACTCAGTA AACTCCTTGC CACAC 55 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGCTGACG TTTACCCGGC CAACG 5 5 

45 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 5 base T:a:r- 
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;xi; SEQUENCE CE3CRIFTI0N: SEQ ID NO : 6 
GTCATCTTCT CGAGTCTT^"^ ...... . .. 
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(2) INFORMATION POR SEQ ID ^o.-V. 

5 J SEQUENCE CHARACTERISTICS 

(A) LENGTH: 64 base r^f 

(B) TVDo ; ^ pairs 
a TYPE: nucleic acid 

C) STRANDEDNESS • sSal 

TOPOLOGY: linear ' 

) 

GGGG 

GCTTTGGACA CCCTCAGGAT ATrrn.. 

---^^CGG CGGCGGCGGC TCCGGGGGGC ^0 

INFORMATION POR SEQ ID NO- 8- 
";ArSS.~HISTICS: 

»> TOPOLOGY: U,=."°''= 

™«TC™. -^-CTTT. .OOCCOCCO : ^ 

^o, ' °«CGCCCCTG GCTGr 

^-^OH«A™roRSBQlDNO:S: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH- 3n ""-^""^ 

(B) TYPE: nucLr" ''f " 
C) STRANDEDNESS: sinnl 

(D) TOPOLOGY: linear ' 

DESCRIPTION: SKQ XO NO • . • 
-AOAATTCAA CCATGGCGGC GGCOGCGGCG 

(2) ^"FO^TIONPORSEQIDNO:IO: 

'i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 29 bas» o ' 

(B) TYPE- ^, n P^^^S 
in til acid 

(C) STRANDEDNESS- sir.r,^ 
TOPOLOGY: l.nea" 



"^'FORMATION FOR SEO I- 



10 NO: 11 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

10 TACTCGAGTA ACCATGGCGG CGGCGGCGGC G 

(2) INFORMATION FOR SEQ ID NO : 12 : 

{i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TAGGATCCAT AGATCTGTAT CCTGG 
25 (2) INFORMATION FOR SEQ ID N0:13: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

35 

TAGGATCCTT AAGATCTGTA TCCTGG 

(2) INFORMATION FOR SEQ ID NO: 14: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



: r:QUE:;cE characteristics 

(A) :.ENGTH: 28 base pairs 
55 (3) TYPE: nucleic acid 
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(D) TOPOLOGY: linear 
Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:15- 

5 

ATGGATCCTC GGACTCCATC TCTTCTTG 2 8 

(2) INFORMATION FOR SEQ ID NO: 16: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNFRS : cinglt; 

(D) TOPOLOGY: linear 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGGATCCTC AGGACTCCAT CTCTTCTTG 2 9 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 8 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

GTCTCGAGCC ATCATGAGCA GAAGCAAG 2 8 

(2) INFORMATION FOR SEQ ID NO: 18: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
45 GTGGATCCCA CTGCTGCACC TGTGCTA 2 7 

(2) INFORMATION FOR SEQ ID NO: 19: 



'GPOLCGY: linear 
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GTGGATCCTC ACTGCTGCAC CTGTGCTA 2 8 

(2) INFORMATION FOR SEQ ID N0:20- 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 



(xi) SEOUFNCE DESCRIPTION: SEQ ID NO: 20: 

15 CGCGAATTCC GCCACCATGA GTGCTGAGGG GTACCAGTAC 4 0 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
CGCGGATCCT GTCGCCTCTG CTGTGCATAT AC 3 2 

30 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: p85-top-C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GGGAGATCTA TGAGTGCTGA GGGGTACCAG 3 0 

45 (2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CPIARACTERISTICS : 



;x;i/ SEQUENCE DESCRIPTION: SEO ID r:0:23 
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(2) INFORMATION POH SEQ XD NO: 24.. 

5 SEQUENCE CHARACTERISTICS 

(A) LENGTH- 33 >,= '"■^''"CS: 

(B) TYPP : pairs 

IC) STRANDEDNESS • sinm 
TOPOLOGY: lineaj 

10 

^^i) .,3CKIPTI0N: SEO 10 NO.. 

OTOAATTCGA CCATGTCGTC CATCTTGCCA TTC ^ ^ 

(2) INFORMATION FOR SEQ m hO:.5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 base n! 

(B) TYPP. r, t pairs 
^° r c^f nucleic acid 

=-™CCC» ^^TOCTT ...c«a=„ c 



30 



35 



ORMATION FOR SEQ ID NO:26: 



(i) SEQUENCE CHARACTERISTIC. 
(A) LENGTH- 32 k=,» = 

TVPE: nuclexc :c?:'"^ 
(C) STRANDEDNESS sSfl 
TOPOLOGY: iLa" 



SEQUENCE DESCRIPTION: SEQ id .0..., 
<0 --^TOCT TCAGCAACGC AC 

^^^°«MATION POR SEQ id NO:27: 

(i) SEQUENCE CHARACTERISTICS 
1= (A) LENGTH- 31 "-^"TICS: 

(B) TYPR , P^irs 



55 ITO^TIOK POR SEO ID KO:28 



' SEQUENCE CH.^PAc-r^x^^,. 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GTGGATCCCT GCTGCTTCCG GTGGAGTTCG 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS 
(A) LENGTH: 31 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GTGGATCCCT AGCTGCTTCC GGTGGAGTTC G 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
35 GTAGATCTAC CATGGCGGGC TGGATCCAGG CC 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 
GTGGTACCCA TGAGAGGGAG CCTCTH^.--. ' 
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..iiNGTH : jl case pairs 
:3) TYPE: nucleic acid 
;C) STRANDEDNESS: single 
(D) TOPOLOGY : : : ^ > 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTGGTACCTC ATGAGAGGGA GCCTCTGGCA G 3 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
GTGAATTCAA CCATGGACAA TATGTCTATT ACG 33 
(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GTGGATCCCA GTCTAAAGGT TGTGGGTCTG C 31 
(2) INFORMATION FOR SEQ ID NO: 35: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GTGGATCCTC AGTCTAAAGG TTGTGGGTCT GC 3 2 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTEPT,^-^^^-' 
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XI) SEQUENCE DESCRIPTION: SEQ TD 
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GTCTCGAGGC ACCATGAGCG ACGTGGC 27 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 37: 
TGGGATCCGA GGCCGTGCTG CTGGCCG 2 7 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1896 base pairs 

20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
25 (ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 
(E) LOCATION: 1...1891 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

35 1 5 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 9 6 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

40 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 14 4 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

45 TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



r-1 



r-i f-^ 



GAG 
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IS Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

8 5 9 0 
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CGC ACC ATC TTC TTC AAG GAG GAC GGC AAC TAG AAG ACC CGC GCC GAG 33 6 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

5 GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 43 2 

10 He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Gei His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
15 145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

20 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

25 CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 62 4 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
30 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
35 225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCA ACC ATG GCG GCG GCG 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ala Ala Ala 

245 250 255 

40 

GCG GCT CAG GGG GGC GGG GGC GGG GAG CCC CGT AGA ACC GAG GGG GTC 816 
Ala Ala Gin Gly Gly Gly Gly Gly Glu Pro Arg Arg Thr Glu Gly Val 

260 265 270 

45 GGC CCG GGG GTC CCG GGG GAG GTG GAG ATG GTG AAG GGG CAG CCG TTC 8 64 

Gly Pro Gly Val Pro Gly Glu Val Glu Met Val Lys Gly Gin Pro Phe 
275 280 285 



••^ - 1.^7 ^ J-^ 1 J. U v_ ^ v„ >\>A\_j ^-j^ ^_7.l>^ 

Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val Arg Lys Thr Arg Val 
55 3 05 3 10 3 15 
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68 



GCC ATC AAG AAG ATC AGC CCC TTC GAA CAT CAG ACQ TAC TGC CAG CGC 
Ala lie Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Sn Arg 

3 25 



"0 335 



ACG CTC CGG GAG ATC CAG ATC CTG CTG CGC TTC CGC CAT GAG AAT GTC 
Thr Leu Arg Glu He Gin He Leu Leu Arg Phe Arg His G^u Z vll 

345 

ATC GGC ATC CGA GAC ATT CTG CGG GCG TCC ACC CTG GAA GCC ATG AGA 
He Gly I e Arg Asp He Leu Arg Ala Ser Thr Leu Glu Ala Zt Arg 

GAT GTC TAC ATT GTG CAU GAC CTG ATG GAG ACT GAC CTG TAC AAG TTG 
ASP v.. xyr lie Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys Leu 



375 



380 



CTG AAA AGC CAG CAG CTG AGC AAT GAC CAT ATC TGC TAC TTC CTC TAC 
Leu Lys Ser Gin Gin Leu Ser Asn Asp His He Cys Tyr Phe Leu T^r 

395 Jo 

CAG ATC CTG CGG GGC CTC AAG TAC ATC CAC TCC GCC AAC GTG CTC CAC 
Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu His 

415 



405 410 



CGA GAT CTA AAG CCC TCC AAC CTG CTC AGC AAC ACC ACC TGC GAC CTT 
Arg Asp Leu Lys Pro Ser Asn Leu Leu Ser Asn Thr Thr Cys Asp Leu 

430 



420 



AAG ATT TGT GAT TTC GGC CTG GCC CGG ATT GCC GAT CCT GAG CAT GAC 
Lys He Cys Asp Phe Gly Leu Ala Arg He Ala Asp Pro Glu His Asp 
435 440 

CAC ACC GGC TTC CTG ACG GAG TAT GTG GCT ACG CGC TGG TAC CGG GCC 
Jhr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg Ala 

455 460 

CCA GAG ATC ATG CTG AAC TCC AAG GGC TAT ACC AAG TCC ATC GAC ATC 
Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp He 

475 480 

TGG TCT GTG GGC TGC ATT CTG GCT GAG ATG CTC TCT AAC CGG CCC ATC 
Trp Ser Val Gly Cys He Leu Ala Glu Met Leu Ser Asn Arg Pro He 

485 490 

TTC CCT GGC AAG CAC TAC CTG GAT CAG CTC AAC CAC ATT CTG GGC ATC 
Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His He Leu Gly r» 

500 505 



1008 



105S 



1104 



1152 



1200 



1248 



1 ? 0^: 

^ J \j 



1344 



1392 



1440 



1488 



1536 



Ala 



TCT CTG CCC TCC AAG ACC AAG GTG GCT TGG :63 



rg Asn Tyr I^eu Gin Ser Leu Pro Ser Lys Thr Lvr Val A 



530 
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GCC AAG CTT TTC CCC AAG TCA GAG TCC AAA GCC CTT GAC CTG CTG GAG 1680 

Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala Leu Asp Leu Leu Asp 
545 550 555 560 

5 CGG ATG TTA ACC TTT AAC CCC AAT AAA CGG ATC ACA GTG GAG GAA GCG 172 8 
Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie Thr Val Glu Glu Ala 

565 570 575 

CTG GOT CAC CCC TAC CTG GAG CAG TAG TAT GAC CCG ACG GAT GAG CCA 17 76 
10 Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Thr Asp Glu Pro 

580 585 590 

GTG GCC GAG GAG CCC TTC ACC TTC GCC ATG GAG CTG GAT GAC CTA CCT 182 4 

Val Ala Glu Glu Pro Phe Thr Phe Ala Met Glu Leu Asp Asp Leu Pro 
15 595 500 605 

AAG GAG CGG CTG AAG GAG CTG ATC TTC CAG GAG ACA GCA CGC TTC CAG 18 72 

Lys Glu Arg Leu Lys Glu Leu He Phe Gin Glu Thr Ala Arg Phe Gin 
610 615 620 



20 



25 



CCC GGA GTG CTG GAG GCC C CCTAG 18 96 

Pro Gly Val Leu Glu Ala Pro 
625 630 



(2) INFORMATION FOR SEQ ID NO: 39: 

{i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 631 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
35 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
40 1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met !;Vf- 
6 5 7 0 

— 1 r ' f ^' ^ • - - 



45 



-- - ' : ^'^-j.^ l::!" l.e*j Val Acn Arg lie Glu Leu Ly3 

115 12C 125 

55 He Asp Phe Lys Glu Asp Gly Asn He Leu Gly ¥A v. Lv? T r.-; ; 

13 0 : ^ ^ 
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Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

5 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
10 210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Gei Asn Ser Thr Met Ala Ala Ala 

245 250 255 

15 Ala Ala Gin Gly Gly Gly Gly Gly Glu Pro Arg Arg Thr Glu Gly Val 

260 265 270 

Gly Pro Gly Val Pro Gly Glu Val Glu Met Val Lys Gly Gin Pro Phe 

275 280 285 

Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr lie Gly Glu Gly Ala 
20 290 295 300 

Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val Arg Lys Thr Arg Val 
305 310 315 320 

Ala lie Lys Lys lie Ser Pro Phe Glu His Gin Thr Tyr Cys Gin Arg 

325 330 335 

25 Thr Leu Arg Glu lie Gin lie Leu Leu Arg Phe Arg His Glu Asn Val 

340 345 350 

lie Gly lie Arg Asp lie Leu Arg Ala Ser Thr Leu Glu Ala Met Arg 

355 360 365 

Asp Val Tyr He Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys Leu 
30 370 375 380 

Leu Lys Ser Gin Gin Leu Ser Asn Asp His He Cys Tyr Phe Leu Tyr 
385 390 395 400 

Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu His 

405 410 415 

35 Arg Asp Leu Lys Pro Ser Asn Leu Leu Ser Asn Thr Thr Cys Asp Leu 

420 425 430 

Lys He Cys Asp Phe Gly Leu Ala Arg He Ala Asp Pro Glu His Asp 

435 440 445 

His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg Ala 
40 450 455 460 

Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp He 
465 470 475 480 

Trp Ser Val Gly Cys He Leu Ala Glu Met Leu Ser Asn Arg Pro He 

485 490 495 

45 Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His He Leu Gly He 

500 505 510 

Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys lie Tie Ar^- ^ 
515 



Sb:: 5 6 C 

r.r i"ne Acr. F-ro Agt. Lys Arg He Thr Val Glu Glu Ala 
565 570 575 



55 Leu Ala His Pro Tyr Leu Glu Glr. 



c q .-^ 
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Val Ala Glu Glu Pro Phe Thr Phe Ala Met Glu Leu Asp Asp Leu Pro 

595 600 605 

Lys Glu Arg Leu Lys Glu Leu He Phe Gin Glu Thr Ala Arg Phe Gin 
610 615 620 

5 Pro Gly Val Leu Glu Ala Pro 
625 630 

(2) INFORMATION FOR SEQ ID NO: 40: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
20 (B) LOCATION: 1...1815 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

25 ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
30 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

40 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

45 CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

P^'"""' ^T^'"' -^-r-^ . . 



O ^^.3 tAj-^^j, bvjC- Jb4 



'Jai Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
55 115 12C 125 
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ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

5 AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

10 Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

. Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
15 180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

20 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

25 GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GTA ACC ATG GCG GCG GCG GCG GCG GCG GGC CCG 76 8 

30 Gly Leu Arg Ser Arg Val Thr Met Ala Ala Ala Ala Ala Ala Gly Pro 

245 250 255 

GAG ATG GTC CGC GGG CAG GTG TTC GAC GTG GGG CCG CGC TAC ACT AAT 816 
Glu Met Val Arg Gly Gin Val Phe Asp Val Gly Pro Arg Tyr Thr Asn 
35 260 265 270 

CTC TCG TAC ATC GGA GAA GGC GCC TAC GGC ATG GTT TGT TCT GCT TAT 8 64 

Leu Ser Tyr He Gly Glu Gly Ala Tyr Gly Met Val Cys Ser Ala Tyr 
275 280 285 

40 

GAT AAT CTC AAC AAA GTT CGA GTT GCT ATC AAG AAA ATC AGT CCT TTT 912 

Asp Asn Leu Asn Lys Val Arg Val Ala He Lys Lys He Ser Pro Phe 
290 295 300 

45 GAG CAC CAG ACC TAC TGT CAG AGA ACC CTG AGA GAG ATA AAA ATC CTA 96 0 

Glu His Gin Thr Tyr Cys Gin Arg Thr Leu Arg Glu He Lys He Leu 
305 310 315 320 

^ rji ^ ^ ^/-^ ^ .-^ , ^ ■ • ^ 



55 



A . a Pre 



lie Glu Gin Ke' 
34 0 



Lys Asp 
345 



JIM 1^.- M.^A ^ i J-^ ^ V- 

Val Tyr He Val Gin Asp Leu 

3 50 



105 6 
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ATG GAG ACA GAT CTT TAC AAG CTC TTG AAG ACA CAG CAC CTC AGC AAT 1104 
Met Glu Thr Asp Leu Tyr Lys Leu Leu Lys Thr Gin His Leu Ser Asn 
355 360 365 

5 GAT CAT ATC TGC TAT TTT CTT TAT CAG ATC CTG AGA GGA TTA AAG TAT 1152 
Asp His He Cys Tyr Phe Leu Tyr Gin He Leu Arg Gly Leu Lys Tyr 
370 375 380 

ATA CAT TCA GCT AAT GTT CTG CAC CGT GAC CTC AAG CCT TCC AAC CTC 12 0 0 

10 He His Ser Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu 
385 390 395 400 

CTG CTG AAC ACC ACT TGT GAT CTC AAG ATC TGT GAC TTT GGC CTT GCC 12 4 8 

Leu Leu Aph Thr Thx" Cys Asp Leu Lys He Cys Asp Phe Gly Leu Ala 
15 405 410 415 

CGT GTT GCA GAT CCA GAC CAT GAT CAT ACA GGG TTC TTG ACA GAG TAT 12 96 

Arg Val Ala Asp Pro Asp His Asp His Thr Gly Phe Leu Thr Glu Tyr 

420 425 430 

20 

GTA GCC ACG CGT TGG TAC AGA GCT CCA GAA ATT ATG TTG AAT TCC AAG 13 44 

Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu He Met Leu Asn Ser Lys 
435 440 445 

25 GGT TAT ACC AAG TCC ATT GAT ATT TGG TCT GTG GGC TGC ATC CTG GCA 13 92 
Gly Tyr Thr Lys Ser He Asp He Trp Ser Val Gly Cys He Leu Ala 
450 455 460 

GAG ATG CTA TCC AAC AGG CCT ATC TTC CCA GGA AAG CAT TAC CTT GAC 144 0 
30 Glu Met Leu Ser Asn Arg Pro He Phe Pro Gly Lys His Tyr Leu Asp 
465 470 475 480 

CAG CTG AAT CAC ATC CTG GGT ATT CTT GGA TCT CCA TCA CAG GAA GAT 14 8 8 

Gin Leu Asn His He Leu Gly He Leu Gly Ser Pro Ser Gin Glu Asp 
35 485 490 495 

CTG AAT TGT ATA ATA AAT TTA AAA GCT AGA AAC TAT TTG CTT TCT CTC 15 36 

Leu Asn Cys He He Asn Leu Lys Ala Arg Asn Tyr Leu Leu Ser Leu 

500 505 510 

40 

CCG CAC AAA AAT AAG GTG CCG TGG AAC AGG TTG TTC CCA AAC GCT GAC 15 84 

Pro His Lys Asn Lys Val Pro Trp Asn Arg Leu Phe Pro Asn Ala Asp 
515 520 525 

45 TCC AAA GCT CTG GAT TTA CTG GAT AAA ATG TTG ACA TTT AAC CCT CAC 16 3 2 

Ser Lys Ala Leu Asp Leu Leu Asp Lys Met Leu Thr Phe Asn Pro His 
530 535 540 



AAG A^"^- 



55 



r Asp Pro Ser Asp Glu Fro lie Aia 
565 570 



Ala Pro Phe Lys Phe 
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GAC ATG GAG CTG GAG GAG TTA CCT AAG GAG AAG CTC AAA GAA CTC ATT 177 6 

Asp Met Glu Leu Asp Asp Leu Pro Lys Glu Lys Leu Lys Glu Leu lie 

580 585 590 

TTT GAA GAG ACT GCT CGA TTC GAG CCA GGA TAG AGA TCT TAA 1818 
Phe Glu Glu Thr Ala Arg Phe Gin Pro Gly Tyr Arg Ser 
595 600 605 



10 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 05 amino acids 

(B) TYPE, amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



20 



25 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly Val 


Val 


Pro 


He 


Leu 


1 








5 










10 










15 




Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 








20 










25 










30 






Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


TVi V 


Leu 


Lya 


Phe 


He 






35 










40 










45 








Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 




50 










55 










60 










Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 


65 










70 










75 










80 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly Tyr 


Val 


Gin 


Glu 










85 










90 










95 




Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 








100 










105 










110 






Val 


Lys 


Phe 


Glu 


Gly Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 






115 










120 










125 








He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 




130 










135 










140 










Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


145 










150 










155 










160 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 










165 










170 










175 




Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 








180 










185 










190 






Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 






195 










200 










"1 r 









































5^^ 



. ■ - . ■ ^ :'j - ^.i - "^i.r *^ A. a Ala A^a Ala Ala Ala Gly Pre 

245 250 255 

D GLu Met Val Arg Gly Gin Val Fhe Asp Val Gly Pro Arg Tyr 



h r A ^ 



260 
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Leu Ser Tyr lie Gly Glu Gly Ala Tyr Gly Met Val Cys Ser Ala Tyr 

275 280 285 

Asp Asn Leu Asn Lys Val Arg Val Ala lie Lys Lys lie Ser Pro Phe 
290 295 300 

5 Glu His Gin Thr Tyr Cyg Gin Arg Thr Leu Arg Glu He Lys He Leu 
305 310 315 320 

Leu Arg Phe Arg His Glu Asn He He Gly lie Asn Asp He He Arg 

325 330 335 

Ala Pro Thr He Glu Gin Met Lys Asp Val Tyr He Val Gin Asp Leu 
10 340 345 350 

Met Glu Thr Asp Leu Tyr Lys Leu Leu Lys Thr Gin His Leu Ser Asn 

355 360 365 

Asp His He Cys Tyr Phe Leu Tyr Gin He Leu Arg Gly Leu Lys Tyr 
37U 375 380 

15 He His Ser Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu 
385 390 395 400 

Leu Leu Asn Thr Thr Cys Asp Leu Lys He Cys Asp Phe Gly Leu Ala 

405 410 415 

Arg Val Ala Asp Pro Asp His Asp His Thr Gly Phe Leu Thr Glu Tyr 
20 420 425 430 

Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu He Met Leu Asn Ser Lys 

435 440 445 

Gly Tyr Thr Lys Ser He Asp He Trp Ser Val Gly Cys He Leu Ala 
450 455 460 

25 Glu Met Leu Ser Asn Arg Pro He Phe Pro Gly Lys His Tyr Leu Asp 
465 470 475 480 

Gin Leu Asn His He Leu Gly He Leu Gly Ser Pro Ser Gin Glu Asp 

485 490 495 

Leu Asn Cys He He Asn Leu Lys Ala Arg Asn Tyr Leu Leu Ser Leu 
30 500 505 510 

Pro His Lys Asn Lys Val Pro Trp Asn Arg Leu Phe Pro Asn Ala Asp 

515 520 525 

Ser Lys Ala Leu Asp Leu Leu Asp Lys Met Leu Thr Phe Asn Pro His 
530 535 540 

35 Lys Arg He Glu Val Glu Gin Ala Leu Ala His Pro Tyr Leu Glu Gin 
545 550 555 560 

Tyr Tyr Asp Pro Ser Asp Glu Pro lie Ala Glu Ala Pro Phe Lys Phe 

565 570 575 

Asp Met Glu Leu Asp Asp Leu Pro Lys Glu Lys Leu Lys Glu Leu He 
40 580 585 590 

Phe Glu Glu Thr Ala Arg Phe Gin Pro Gly Tyr Arg Ser 
595 600 605 



45 



(2) INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2529 base pairs 

(B) TYPE : r.ur^ r> ' - 



55 ;a; NAI-IE/KEY: Codma Seauencp 
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(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;42: 

5 ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 9 6 

10 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTC ACC CTU AAG TTC ATC 14 4 

Glu Gly Glu Cly Aap Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
15 35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

20 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

25 CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 28 8 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

30 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
35 115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 43 2 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

40 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

145 150 155 160 

45 GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 



. ■ . — _ /"^-.C, 1 AC CTG AGC A.CC CAG TCC GCC C j. G 6 2 

?ro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
55 195 2 0C 2C5 
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720 



10 



768 



AGC AAA GAC CCC AAr rnn 

- - - i - - s - 
i - - - - i - s s= s z s ^= 

235 '^ys Ser 

GGA CTC AGA TCT CGA GHT 

^-^ - - ^- ™ s 2 2 - 

250 ^ GIu 

ATC GTG rnr n^v^ ^55 

V.1° S ^? .™ ™ « .r. „ 

" " ^' S or;; z 

GGA AAG CGC AAA Grr a^. 

AT, J- MO AAG TGG AAA GAA ATC CTG AA. 

So n'e' Hi 

s ^^^^^ - - - .A A :j: 



672 



■1 c 



816 



912 



30 



40 



25 TAC TGC AGT TTA TGT P.n 

- - - - s s r ™ - ™ - 

""^S Leu Phe Arg 

CAG TTT TGT GAA flnr 32 0 

- - T^^ - - - -0 GAG TGT TAC ATT CAG TTC CTG 

y Leu Glu cys Tyr He cin Phe S 

OAC TCC GTG GCA GAA TAT n. 

',it - - - s s s z ^. - - - 

345 ^ ^-^^ Lys Leu Gly Qlu 

GGG AAG GAA ATT ATn 

- - - ™ S - - - ... 

3 60 ^ys Ser Pro Val 

TTC ATA GCC CAA GTT rnr^ 

J,'.' -° =t s z S S ™ - ... ^„ 

3 75 Thr Glu Glu Lys 



960 



1008 



35 



1056 



1104 



1152 



r r 



CTC CTA CAG AAG CCG Trp 
Leu Leu oin r ^ QAA CTC ttt 

3BS Uys 

3^0 Ser Ala Cys Ala Gin Ser 

GTC CAC GAG TA^ 400 



1200 



ne 



Asp 
420 



A'g Phe Le'^ Gin Trp 

425 



?n "^^^^ CC 

^rp Leu Glu Arg Gin Pr 

430 



1296 



o 
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5 



10 



GTG ACC AAA AAC ACT TTP isnr. 
val T.. . -0 ^ 

440 ^ Lys Gly Gly 

445 

TTC GGG GAG GTC TGT GCC TOP n.^ 

=1. V,. - - - ?s iz r ™ 

455 Lys Met Tvr 

460 ^ 

GCC TGC AAG CGC TTr nnn 

... - - - ^ ™ ^ „ ^ 

470 Lys Gly Qlu 

47=; ^ ^ 



1344 



1392 



475 

480 



15 



TCC ATG GCC CTC AAT PAr n.^ 
SPr Mo^ .1, , CAG ATC PTn n.^ . 

Lys Gin He Leu c7 "^"^ ^AG 

4 85 Lys Val Asn Ser Gin 



490 

495 



20 



TTT GTG GTC AAC Pth ^r,^ 

- V.I ..I ..^ '^l IT. - « M. 0.. ™ 

"° 505 ^" Asp Ala Leu Cys 

TTG GTC CTG ACC ATC iTo a.™ 

- v.: ^ ™ - - ooT 0.C 2. «a ™ 

520 ^ His He Tyr 

AAC ATG GGC AAC CCT GGC TTC r^n . 

- - «v - - - S S SI s 

S s» ™ s - - s - - - : 

550 *:=u MIS Arg Glu Asn Thr Val 



555 

560 



TAC CGA GAT CTG AAA 

Arg Asp .eu Lys ttl aTu Z Ue Su '^'^ ^'^^ ™^ 

565 t"^^ Asp Asp Tyr Gly His 



570 

575 



- - 2 z I- s ™ - - 

lie Pro Glu Gly fl^p 

- - fs 0=?^ s,^ =r r ™ - - - 

600 " ^" V.l 

- r - - - - ™, 1" 

^ Asp Tvr Trn ri,, T 

615 ^ Gly Leu Gly 

620 ^ 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



^ys Glu Ly 



V 



Sr: Sr I;: f ^ ^-^^ 

645 ^^'^ Arg Val Leu Glu Th- 

650 

(T c r 



1968 
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GAG GAG GTG TAG TCC CAC AAG TTC TCC GAG GAG GCC AAG TCC ATC TGC 2 016 

Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala Lys Ser lie Cys 

660 665 670 

AAG ATG CTG CTC ACG AAA GAT GCG AAG GAG AGG CTG GGC TGC CAG GAG 2 0 64 
Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu Gly Cys Gin Glu 
675 680 685 

GAG GGG GCT GCA GAG GTC AAG AGA CAC CCC TTC TTC AGG AAC ATG AAC 2112 
Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe Arg Asn Met Aen 
690 695 700 



TTC AAG CGC TTA GAA GCC GGG ATG TTG GAC CCT CCC TTC GTT CCA GAC 2160 
Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro Phe Val Pro Asp 
15 705 710 715 720 

CCC CGC GCT GTG TAC TGT AAG GAC GTG CTG GAC ATC GAG CAG TTC TCC 22 0 8 

Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp lie Glu Gin Phe Ser 

725 730 735 

20 

ACT GTG AAG GGC GTC AAT CTG GAC CAC ACA GAC GAC GAC TTC TAC TCC 22 5 6 

Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp Asp Phe Tyr Ser 

740 745 750 

25 AAG TTC TCC ACG GGC TCT GTG TCC ATC CCA TGG CAA AAC GAG ATG ATA 23 04 
Lys Phe Ser Thr Gly Ser Val Ser lie Pro Trp Gin Asn Glu Met lie 
755 760 765 

GAA ACA GAA TGC TTT AAG GAG CTG AAC GTG TTT GGA CCT AAT GGT ACC 2 3 52 
30 Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly Pro Asn Gly Thr 
770 775 780 

CTC CCG CCA GAT CTG AAC AGA AAC CAC CCT CCG GAA CCG CCC AAG AAA 24 0 0 
Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu Pro Pro Lys Lys 
35 785 790 795 800 

GGG CTG CTC CAG AGA CTC TTC AAG CGG CAG CAT CAG AAC AAT TCC AAG 2 44 8 

Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin Asn Asn Ser Lys 

805 810 815 

40 

AGT TCG CCC AGC TCC AAG ACC AGT TTT AAC CAC CAC ATA AAC TCA AAC 24 9 6 

Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His lie Asn Ser Asn 

820 825 830 

45 CAT GTC AGC TCG AAC TCC ACC GGA AGC AGC TAG 2 52 9 

His Val Ser Ser Asn Ser Thr Gly Ser Ser 
835 840 



^ t » 1 n ! o *-* ^ m 1 ci c ^ ! 
(3) TYPE: ar^^mo acid 
55 (C) STRANDEDNESS : single 

(D) TOPOLOGY : ■! nr>,T > 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
10 20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pre Thr Leu Val Thr Thr 
50 55 60 

15 Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

as 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
20 100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

25 Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
30 180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

35 Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ser Met Glu Leu Glu 

245 250 255 

Asn He Val Ala Asn Thr Val Leu Leu Lys Ala Arg Glu Gly Gly Gly 
40 260 265 270 

Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu He Leu Lys Phe 

275 280 285 

Pro His He Ser Gin Cys Glu Asp Leu Arg Arg Thr He Asp Arg Asp 
290 295 300 

45 Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg Leu Leu Phe Arg 
305 310 315 320 

Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr He Gin Phe Le\: 

325 



55 



....... ^ ^.y .J^n Asp Leu Val Ser Gin Thr Glu Glu Lys 

37C 375 360 

Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala Cyn A^. a 

3 85 3^" 
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Val His Glu Tyr Leu Arg Gly Glu Pro Phe His Glu Tyr Leu Asp Ser 

405 410 415 

Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu Glu Arg Gin Pro 

420 425 430 

5 Val Thr Lys Asn Thr Phe Arg Gin Tyr Arg Val Leu Gly Lys Gly Gly 

435 440 445 

Phe Gly Glu Val Cys Ala Cys Gin Val Arg Ala Thr Gly Lys Met Tyr 

450 455 460 

Ala Cys Lys Arg Leu Glu Lys Lys Arg lie Lys Lys Arg Lys Gly Glu 
10 465 470 475 480 

Ser Met Ala Leu Asn Glu Lys Gin lie Leu Glu Lys Val Asn Ser Gin 

485 490 495 

Phe Val Val Asn Leu Ala Tyr Ala Tyr Glu Thr Lye Asp Ala Leu Cys 

500 505 510 

15 Leu Val Leu Thr lie Met Asn Gly Gly Asp Leu Lys Phe His He Tyr 

515 520 525 

Asn Met Gly Asn Pro Gly Phe Glu Glu Glu Arg Ala Leu Phe Tyr Ala 

530 535 540 

Ala Glu lie Leu Cys Gly Leu Glu Asp Leu His Arg Glu Asn Thr Val 
20 545 550 555 560 

Tyr Arg Asp Leu Lys Pro Glu Asn He Leu Leu Asp Asp Tyr Gly His 

565 570 575 

He Arg He Ser Asp Leu Gly Leu Ala Val Lys He Pro Glu Gly Asp 

580 585 590 

25 Leu He Arg Gly Arg Val Gly Thr Val Gly Tyr Met Ala Pro Glu Val 

595 600 605 

Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr Trp Gly Leu Gly 

610 615 620 

Cys Leu He Tyr Glu Met He Glu Gly Gin Ser Pro Phe Arg Gly Arg 
30 625 630 635 640 

Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg Val Leu Glu Thr 

645 650 655 

Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala Lys Ser He Cys 

660 665 670 

35 Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu Gly Cys Gin Glu 

675 680 685 

Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe Arg Asn Met Asn 

690 695 700 

Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro Phe Val Pro Asp 
40 705 710 715 720 

Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp He Glu Gin Phe Ser 

725 730 735 

Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp Asp Phe Tyr Ser 

740 745 750 

45 Lys Phe Ser Thr Gly Ser Val Ser He Pro Trp Gin Asn Glu Met He 

755 760 765 

Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly Pro Asn Gly Th- 

77 0 77 5 

Leu Pro r^-^ -^^^ 



55 



1-4 2_ 



820 

/al Ser Ser Asn Ser Thr Gl- 



825 

Ser Ser 



His lie Asn Ser Asn 
830 



835 



n A r. 
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10 



15 



20 



(2) WMATIONPORSEQION0..44: 

(i) SEQUENCE CHARACTERISTICS. 
A) LENGTH: 1902 base L 
II ™: nucleic ac./'^" 
C STRANDEDNESS : single 
(D) TOPOLOGY: linear 

Jii) MOLECULE TYPE: cDNA 
(IX) FEATURE: 

(A) NAME/KEY- roH-i 
fR) r«^,lrr • ^"^^'^S Sequence 

W OTHER INFORMATION: 
'^i) SEOUEKCE HESCHXPTION: SEQ I. .0:44 



25 



30 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ^.n 
^ -V Gl. Gl. - - - -G GTG C^c ATC CTG 

10 •'■J-e Leu 

GTC GAG CTG GAC Grr r^r. 

- - - r - - - 2? - - s 

GAG GGC GAG GGC GAT GCO =nn 

Glu G]v r.i„ ^ '^'^^ ACC TAC GGC „ 

' 3-r - ™' ^ l;: - p - - ™ 

- - - - s.^ s s s - r= - - - 

55 ^ ^"""^ i^eu val Thr Thr 

fin "-^ 



35 CTG ACC TAC GGC GTr n.r. 

- - ?s 1:1 z z iti :t - ™ « 

70 His Met Lys 

CAG CAC GAC TTC TTP ^° 

- - - :r 2 s :2 s - - . 

CGC ACC ATC TTC TTr 

- a. . ... - - «e ^ 

105 ^ Ala Glu 

- 2.= - - - - ccc „c 



40 



45 



48 



96 



144 



192 



240 



288 



336 



- J D 



5^ 



'■^C TAG AAC AGC 



140 



Asn Tyr Asn 



Ser His Asr. 



AAC GTC TAT AT 



V- 1 



C ATG GCC GAC AAG 
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145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
5 165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

10 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 20b 

15 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

20 Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

225 230 235 240 

GGA CTC AGA TCT CGA GCT CGA GCC ATC ATG AGC AGA AGC AAG CGT GAC 768 

Gly Leu Arg Ser Arg Ala Arg Ala lie Met Ser Arg Ser Lys Arg Asp 
25 245 250 255 

AAC AAT TTT TAT AGT GTA GAG ATT GGA GAT TCT ACA TTC ACA GTC CTG 816 

Asn Asn Phe Tyr Ser Val Glu lie Gly Asp Ser Thr Phe Thr Val Leu 

260 265 270 

30 

AAA CGA TAT CAG AAT TTA AAA CCT ATA GGC TCA GGA GCT CAA GGA ATA 8 64 

Lys Arg Tyr Gin Asn Leu Lys Pro lie Gly Ser Gly Ala Gin Gly lie 
275 280 285 

35 GTA TGC GCA GCT TAT GAT GCC ATT CTT GAA AGA AAT GTT GCA ATC AAG 912 

Val Cys Ala Ala Tyr Asp Ala He Leu Glu Arg Asn Val Ala He Lys 

290 295 300 

AAG CTA AGC CGA CCA TTT CAG AAT CAG ACT CAT GCC AAG CGG GCC TAC 96 0 

40 Lys Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr 

305 310 315 320 

AGA GAG CTA GTT CTT ATG AAA TGT GTT AAT CAC AAA AAT ATA ATT GGC 10 0 8 

Arg Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Gly 
45 325 330 335 

CTT TTG AAT GTT TTC ACA CCA CAG AAA TCC CTA GAA GAA TTT CAA HAT ^ ^ ^ 

Leu Leu Asn Val Phe Thr Pro ^-"Ir ^ •.*- ■ 



"1 (') 



365 



55 



:AG ATG GAG CTA GAT CAT GAA AGA ATG 
;ln I^et Glu Leu A.^n H:- • " ■ 



TAC CTT 
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370 375 380 

CTG TGT GGA ATC AAG CAC CTT CAT TCT GCT GGA ATT ATT CAT CGG GAC 12 00 

Leu Cys Gly lie Lys His Leu His Ser Ala Gly lie He His Arg Asp 
385 390 395 400 

TTA AAG CCC AGT AAT ATA GTA GTA AAA TCT GAT TGC ACT TTG AAG ATT 12 4 8 

Leu Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He 

405 410 415 

CTT GAC TTC GGT CTG GCC AGG ACT GCA GGA ACG AGT TTT ATG ATG ACG 12 96 

Leu Asp Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr 

420 47^ 430 

15 CCT TAT GTA GTG ACT CGC TAC TAG AGA GCA CCC GAG GTC ATC CTT GGC 13 44 

Pro Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly 
435 440 445 

ATG GGC TAC AAG GAA AAC GTG GAT TTA TGG TCT GTG GGG TGC ATT ATG 13 92 

20 Met Gly Tyr Lys Glu Asn Val Asp Leu Trp Ser Val Gly Cys He Met 
450 455 460 

GGA GAA ATG GTT TGC CAC AAA ATC CTC TTT CCA GGA AGG GAC TAT ATT 14 4 0 

Gly Glu Met Val Cys His Lys He Leu Phe Pro Gly Arg Asp Tyr He 
25 465 470 475 480 

GAT CAG TGG AAT AAA GTT ATT GAA CAG CTT GGA ACA CCA TGT CCT GAA 1488 

Asp Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu 

485 490 495 

30 

TTC ATG AAG AAA CTG CAA CCA ACA GTA AGG ACT TAC GTT GAA AAC AGA 153 6 

Phe Met Lys Lys Leu Gin Pro Thr Val Arg Thr Tyr Val Glu Asn Arg 

500 505 510 

35 CCT AAA TAT GCT GGA TAT AGC TTT GAG AAA CTC TTC CCT GAT GTC CTT 15 84 

Pro Lys Tyr Ala Gly Tyr Ser Phe Glu Lys Leu Phe Pro Asp Val Leu 
515 520 525 

TTC CCA GCT GAC TCA GAA CAC AAC AAA CTT AAA GCC AGT CAG GCA AGG 1632 

40 Phe Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg 
530 535 540 

GAT TTG TTA TCC AAA ATG CTG GTA ATA GAT GCA TCT AAA AGG ATC TCT 168 0 

Asp Leu Leu Ser Lys Met Leu Val He Asp Ala Ser Lys Arg He Ser 
45 545 550 555 560 

GTA GAT GAA GCT CTC CAA CAC CCG TAC ATC AAT GTC TGG TAT GAT ^^'^ 

Val AsD Glu Ala Leu Gin Ki? Pr-- ^ ' 



5 35 5 9C 

55 GAA AGG GAA CAC ACA ATA GAA GAG TGG AAA GAA TTO ATA ^^"^ ' ' 

Glu Arq Glu H i. s Th^ T ' ^-V, '*; 
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45 



85 

595 600 605 

GTT ATG GAC TTG GAG GAG AGA ACC AAG AAT GGA GTT ATA CGG GGG GAG 18 72 
Val Met Asp Leu Glu Glu Arg Thr Lys Asn Gly Val lie Arg Gly Gin 
610 615 620 

CCC TOT CCT TTA GCA GAG GTG GAG GAG TGA 1902 
Pro Ser Pro Leu Ala Gin Val Gin Gin 
625 630 



(2) INFORMATION FOR SEQ ID NO: 45 



(i) CEQUENCii; CHARACTERISTICS: 
15 (A) LENGTH: 633 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lya Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
30 3 5 4 0 4 5 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
40 115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp G^, v 

180 



'-■'■^■^ -^-y --ti :r.r Leu G .1 y Met Asp Glu Leu Tyr Lys Ser 

-25 23G 235 240 

55 Gly Leu Arg Ser Arg Ala Arg Ala He r^et Ser Ara Ser L\'- - v 

245 
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Asn Asn Phe Tyr Ser Val Glu lie Gly Asp Ser Thr Phe Thr Val Leu 

260 265 270 

Lys Arg Tyr Gin Asn Leu Lys Pro lie Gly Ser Gly Ala Gin Gly lie 
275 280 285 

5 Val Cys Ala Ala Tyr Asp Ala He Leu Glu Arg Asn Val Ala He Lys 
290 295 300 

Lys Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr 
305 310 315 320 

Arg Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Gly 
10 325 330 335 

Leu Leu Asn Val Phe Thr Pro Gin Lys Ser Leu Glu Glu Phe Gin Asp 

340 345 350 

Val Tyr He Val Met Glu Leu Met Asp Ala Asn Leu Cys Uln Val He 



1 C r- 

-I -i J 



360 365 



15 Gin Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met 
370 375 380 

Leu Cys Gly He Lys His Leu His Ser Ala Gly He He His Arg Asp 
385 390 395 400 

Leu Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He 
20 405 410 415 

Leu Asp Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr 

420 425 430 

Pro Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly 
435 440 445 

25 Met Gly Tyr Lys Glu Asn Val Asp Leu Trp Ser Val Gly Cys He Met 
450 455 460 

Gly Glu Met Val Cys His Lys He Leu Phe Pro Gly Arg Asp Tyr He 
465 470 475 480 

Asp Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu 
30 485 490 495 

Phe Met Lys Lys Leu Gin Pro Thr Val Arg Thr Tyr Val Glu Asn Arg 

500 505 510 

Pro Lys Tyr Ala Gly Tyr Ser Phe Glu Lys Leu Phe Pro Asp Val Leu 
515 520 525 

35 Phe Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg 
530 535 540 

Asp Leu Leu Ser Lys Met Leu Val He Asp Ala Ser Lys Arg He Ser 
545 550 555 560 

Val Asp Glu Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro 
40 565 570 575 

Ser Glu Ala Glu Ala Pro Pro Pro Lys He Pro Asp Lys Gin Leu Asp 

580 585 590 

Glu Arg Glu His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu 
595 600 605 

45 Val Met Asp Leu Glu Glu Arg Thr Lys Asn Gly Val He Arg Gly Gin 
610 615 620 

Pro Ser Pro Leu Ala Gin Val Gin Gin 
625 630 



::: 1824 base pa:.r 
(B) TYPE: nucleic acid 
55 (C) STRANDEDNES3 ; single 

(D) TOPOLOGY' : : ' - 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



10 



40 



87 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . . 1821 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 



^ 15 



15 GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

20 Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

25 50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

30 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 28B 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

35 CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

45 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AA-^ • - ^ 

Asn Tyr Asn Ser Hir, Arr -^^ -^ ' 



- . ^- . L - : \ ^ '.J I \ ^ I . f vjT J. ^ Id C .._7 IV ^ C r 

- -t 

-65 l-r^ ;^75 



55 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 

Val G: v. :.r^/: A^ ^ T\rr - - 
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180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAG CTG AGC ACC GAG TCC GCC CTG 62 4 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

15 GGA CTC AGA TCT CGA GGG AAA ATG TCT CAG GAG AGG CCC ACG TTC TAC 768 
Gly Leu Arg Ser Arg Gly Lys Met Ser Gin Glu Arg Pro Thr Phe Tyr 

245 250 255 



10 



20 



CGG CAG GAG CTG AAC AAG ACA ATC TGG GAG GTG CCC GAG CGT TAC CAG 816 
Arg Gin Glu Leu Asn Lys Thr lie Trp Glu Val Pro Glu Arg Tyr Gin 

260 265 270 



AAC CTG TCT CCA GTG GGC TCT GGC GCC TAT GGC TCT GTG TGT GCT GCT 8 64 

Asn Leu Ser Pro Val Gly Ser Gly Ala Tyr Gly Ser Val Cys Ala Ala 
25 275 280 285 

TTT GAC ACA AAA ACG GGG TTA CGT GTG GCA GTG AAG AAG CTC TCC AGA 912 

Phe Asp Thr Lys Thr Gly Leu Arg Val Ala Val Lys Lys Leu Ser Arg 

290 295 300 

30 

CCA TTT CAG TCC ATC ATT CAT GCG AAA AGA ACC TAC AGA GAA CTG CGG 96 0 

Pro Phe Gin Ser lie lie His Ala Lys Arg Thr Tyr Arg Glu Leu Arg 
305 310 315 320 

35 TTA CTT AAA CAT ATG AAA CAT GAA AAT GTG ATT GGT CTG TTG GAC GTT 10 0 8 

Leu Leu Lys His Met Lys His Glu Asn Val lie Gly Leu Leu Asp Val 

325 330 335 

TTT ACA CCT GCA AGG TCT CTG GAG GAA TTC AAT GAT GTG TAT CTG GTG 105 6 

40 Phe Thr Pro Ala Arg Ser Leu Glu Glu Phe Asn Asp Val Tyr Leu Val 

340 345 350 

ACC CAT CTC ATG GGG GCA GAT CTG AAC AAC ATT GTG AAA TGT CAG AAG 1104 

Thr His Leu Met Gly Ala Asp Leu Asn Asn lie Val Lys Cys Gin Lys 
45 355 360 365 

CTT ACA GAT GAC CAT GTT CAG TTC CTT ATC TAC CAA ATT CTC CCA G^^ ' ^ ^ 

Leu Thr Asp Asp His Val Gin Ph'= t ^ 



3 95 



4 00 



55 AGT A,^T CTA GCT GTG AAT G]\A GAC TGT GAG CTG PJ^.G A' 

Ser Asn 



Leu Ala 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



89 

405 410 415 

GGA CTG GCT CGG CAC ACA GAT GAT GAA ATG ACA GGC TAC GTG GCC ACT 12 96 

Gly Leu Ala Arg His Thr Asp Asp Glu Met Thr Gly Tyr Val Ala Thr 

420 425 430 

AGG TGG TAC AGG GCT CCT GAG ATC ATG CTG AAC TGG ATG CAT TAC AAC 13 44 

Arg Trp Tyr Arg Ala Pro Glu He Met Leu Asn Trp Met His Tyr Asn 
435 440 445 

CAG ACA GTT GAT ATT TGG TCA GTG GGA TGC ATA ATG GCC GAG CTG TTG 13 92 

Gin Thr Val Asp He Trp Ser Val Gly Cys He Met Ala Glu Leu Leu 
450 455 4b0 

15 ACT GGA AGA ACA TTG TTT CCT GGT ACA GAC CAT ATT GAT CAG TTG AAG 144 0 

Thr Gly Arg Thr Leu Phe Pro Gly Thr Asp His He Asp Gin Leu Lys 
465 470 475 480 



10 



20 



40 



CTC ATT TTA AGA CTC GTT GGA ACC CCA GGG GCT GAG CTT TTG AAG AAA 14 8 8 

Leu He Leu Arg Leu Val Gly Thr Pro Gly Ala Glu Leu Leu Lys Lys 

485 490 495 



ATC TCC TCA GAG TCT GCA AGA AAC TAT ATT CAG TCT TTG ACT CAG ATG 15 3 6 

He Ser Ser Glu Ser Ala Arg Asn Tyr He Gin Ser Leu Thr Gin Met 

25 500 505 510 

CCG AAG ATG AAC TTT GCG AAT GTA TTT ATT GGT GCC AAT CCC CTG GCT 15 84 

Pro Lys Met Asn Phe Ala Asn Val Phe He Gly Ala Asn Pro Leu Ala 
515 520 525 

30 

GTC GAC TTG CTG GAG AAG ATG CTT GTA TTG GAC TCA GAT AAG AGA ATT 163 2 

Val Asp Leu Leu Glu Lys Met Leu Val Leu Asp Ser Asp Lys Arg He 

530 535 540 

35 ACA GCG GCC CAA GCC CTT GCA CAT GCC TAC TTT GCT CAG TAC CAC GAT 16 8 0 

Thr Ala Ala Gin Ala Leu Ala His Ala Tyr Phe Ala Gin Tyr His Asp 

545 550 555 560 



CCT GAT GAT GAA CCA GTG GCC GAT CCT TAT GAT CAG TCC TTT GAA AGC 172 8 

Pro Asp Asp Glu Pro Val Ala Asp Pro Tyr Asp Gin Ser Phe Glu Ser 

565 570 575 



AGG GAC CTC CTT ATA GAT GAG TGG AAA AGC CTG ACC TAT GAT GAA GTC 17 7 6 

Arg Asp Leu Leu He Asp Glu Trp Lys Ser Leu Thr Tyr Asp Glu Val 

45 580 585 590 

ATC AGC TTT GTG CCA CCA CCC CTT GAC CAA GAA GAG ATG GAG TCC T^^A 

He Ser Phe Val Pro Pro P^^ ^ " ' 



■i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 607 ami-c acids 

'3) TYPE: r^.^^.--." 
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(C) STRAmEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
5 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
10 1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly LyG Lea Thr Leu Lys Phe lie 
35 40 45 

15 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
20 65 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

25 lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lis Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
30 165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

35 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Gly Lys Met Ser Gin Glu Arg Pro Thr Phe Tyr 
^0 245 250 255 

Arg Gin Glu Leu Asn Lys Thr He Trp Glu Val Pro Glu Arg Tyr Gin 

260 265 270 

Asn Leu Ser Pro Val Gly Ser Gly Ala Tyr Gly Ser Val Cys Ala Ala 
275 280 285 

45 Phe Asp Thr Lys Thr Gly Leu Arg Val Ala Val Lys Lys Leu Ser Arg 
290 295 300 

Pro Phe Gin Ser He He His Ala Lys Arg Thr Tyr Arg Glu Leu Arc: 
305 310 



■ A^.i Asp Leu Asn Agh He Val Lys Cys Gin Lys 

3 55 36C 365 

55 Leu Thr Asp Asp His Val Gin Phe Leu He Tvr Gin :1 
370 1 
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Leu Lys Tyr lie His Ser Ala Asp He He His Arg Asp Leu Lya Pro 
385 390 395 400 

Ser Asn Leu Ala Val Asn Glu Asp Cys Glu Leu Lys He Leu Asp Phe 

405 410 415 

5 Gly Leu Ala Arg His Thr Asp Asp Glu Met Thr Gly Tyr Val Ala Thr 

420 425 430 

Arg Trp Tyr Arg Ala Pro Glu He Met Leu Asn Trp Met His Tyr Asn 

435 440 445 

Gin Thr Val Asp He Trp Ser Val Gly Cys He Met Ala Glu Leu Leu 
10 450 455 460 

Thr Gly Arg Thr Leu Phe Pro Gly Thr Asp His He Asp Gin Leu Lys 
465 470 475 480 

Leu He Leu Arg Leu Val Gly Thr Pro Gly Ala GTn Leu Leu Lys Lys 

485 490 495 

15 He Ser Ser Glu Ser Ala Arg Asn Tyr He Gin Ser Leu Thr Gin Met 

500 505 510 

Pro Lys Met Asn Phe Ala Asn Val Phe He Gly Ala Asn Pro Leu Ala 

515 520 525 

Val Asp Leu Leu Glu Lys Met Leu Val Leu Asp Ser Asp Lys Arg He 
20 530 535 540 

Thr Ala Ala Gin Ala Leu Ala His Ala Tyr Phe Ala Gin Tyr His Asp 
545 550 555 560 

Pro Asp Asp Glu Pro Val Ala Asp Pro Tyr Asp Gin Ser Phe Glu Ser 

565 570 575 

Arg Asp Leu Leu He Asp Glu Trp Lys Ser Leu Thr Tyr Asp Glu Val 

580 585 590 

He Ser Phe Val Pro Pro Pro Leu Asp Gin Glu Glu Met Glu Ser 
595 600 605 

30 (2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2907 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



40 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . .2904 
(D) OTHER INFORMATION: 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4^ 
Met Val Ser Lys Gly Glu Glu Leu Phe Th^ ^ ' 

1 



3 0 



55 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC 

Glu Gly Glu Gly Asp Ala Thr Tvr f^-V.' ^- :. . 
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35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 28 B 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

15 CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 33 6 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

20 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 43 2 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
25 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr A.sn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

30 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

35 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
40 Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 6 72 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
45 210 215 220 



GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC '7?'^ 
Val 

725 



Val Thr Ala Ala Gly lie Thr Leu Glv M'^^ ^-r 



2 5 5 



55 GAT TAT AAA AAG GAA AGA GAA GAA GAT ATT GAC TTG CAC TTG 

Asp Tyr Lys Lys Glu Arc G^m ^ " ; 
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260 265 270 

ATA TTG ACT GTG AAT AAA GGG TCC TTA GTA GCT CTT GGA TTC AGT GAT 864 
lie Leu Thr Val Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp 
275 280 285 

GGA CAG GAA GCC AGG CCT GAA GAA ATT GGC TGG TTA AAT GGC TAT AAT 912 
Gly Gin Glu Ala Arg Pro Glu Glu lie Gly Trp Leu Asn Gly Tyr Asn 
290 295 300 

GAA ACC ACA GGG GAA AGG GGG GAC TTT CCG GGA ACT TAC GTA GAA TAT 96 0 

Glu Thr Thr Gly Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr 
305 310 315 320 

15 ATT GGA AGG AAA AAA ATC TCG CCT CCC ACA CCA AAG CCC CGG CCA CCT 100 8 

lie Gly Arg Lys Lys lie Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro 

325 330 335 

CGG CCT CTT CCT GTT GCA CCA GGT TCT TCG AAA ACT GAA GCA GAT GTT 105 6 

20 Arg Pro Leu Pro Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val 

340 345 350 

GAA CAA CAA GCT TTG ACT CTC CCG GAT CTT GCA GAG CAG TTT GCC CCT 1104 
Glu Gin Gin Ala Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro 
25 355 360 365 

CCT GAC ATT GCC CCG CCT CTT CTT ATC AAG CTC GTG GAA GCC ATT GAA 1152 

Pro Asp lie Ala Pro Pro Leu Leu lie Lys Leu Val Glu Ala lie Glu 
370 375 380 

30 

AAG AAA GGT CTG GAA TGT TCA ACT CTA TAC AGA ACA CAG AGC TCC AGC 12 0 0 

Lys Lys Gly Leu Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser 
385 390 395 400 

35 AAC CTG GCA GAA TTA CGA CAG CTT CTT GAT TGT GAT ACA CCC TCC GTG 12 4 8 

Asn Leu Ala Glu Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val 

405 410 415 

GAC TTG GAA ATG ATC GAT GTG CAC GTT TTG GCT GAC GCT TTC AAA CGC 12 96 

40 Asp Leu Glu Met lie Asp Val His Val Leu Ala Asp Ala Phe Lys Arg 

420 425 430 

TAT CTC CTG GAC TTA CCA AAT CCT GTC ATT CCA GCA GCC GTT TAC AGT 13 44 

Tyr Leu Leu Asp Leu Pro Asn Pro Val lie Pro Ala Ala Val Tyr Ser 
45 435 440 445 

GAA ATG ATT TCT TTA GCT CCA GAA GTA CAA AGC TCC GAA GAA TAT ATT ^'^9/ 

Glu Met lie Ser Leu Ala Pro Glu Val ^ 

4 c; r> 



/ ^ 
48C 



55 TGG CTT AGG CTT CAG TAT TTG TTA AAA CAT TTC TTC AAG CTC TCT ^A' 

Trp Leu Thr Leu Gin T^/r I ---^ :■ ; - . 
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ACC TCC AGC 
Thr Ser Ser 

5 

AGC CCT ATG 
Ser Pro Met 
515 

10 

AAC CTC ATA 
Asn Leu lie 
530 

15 CGA CAG CCT 
Arg Gin Pro 
545 

GTA GCC AAC 
20 Val Ala Asn 



TGG TAG TGG 
Trp Tyr Trp 

25 

GAT ACA GCA 
Asr> Thr Ala 
595 

30 

CAT GGT GAT 
His Gly Asp 
610 

35 ATC AAA ATA 
lie Lya lie 
625 

ACC TTC ACT 
40 Thr Phe Ser 



CTA GCT CAG 
Leu Ala Gin 

45 

TCC AAA TAG 
Ser Lys Tyr 



485 

AAA AAT CTG TTG AAT 
Lys Asn Leu Leu Asn 
500 

CTT TTC AGA TTC TCA 
Leu Phe Arg Phe Ser 

520 

AAA GTT ATA GAA ATT 
Lys Val He Glu He 

535 

GCA CCA GCA CTG CCT 
Ala Pro Ala Leu Pro 
550 

AAC GGT ATG AAT AAC 
Asn Gly Met Asn Asn 
565 

GGA GAT ATC TCG AGG 
Gly Asp He Ser Arg 
5B0 

GAC GGG ACC TTT TTG 
Asp Gly Thr Phe Leu 

600 

TAT ACT CTT ACA CTA 
Tyr Thr Leu Thr Leu 

615 

TTT CAT CGA GAT GGG 
Phe His Arg Asp Gly 
630 

TCT GTG GTT GAA TTA 
Ser Val Val Glu Leu 
645 

TAT AAT CCC AAA TTG 
Tyr Asn Pro Lys Leu 
660 

CAA CAG GAT CAA GTT 
Gin Gin Asp Gin Val 



94 

490 

GCA AGA GTA CTC TCT 
Ala Arg Val Leu Ser 
505 

GCA GCC AGC TCT GAT 
Ala Ala Ser Ser Asp 

525 

TTA ATC TCA ACT GAA 
Leu He Ser Thr Glu 

540 

CCT AAA CCA CCA AAA 
Pro Lys Pro Pro Lys 
555 

AAT ATG TCC TTA CAA 
Asn Met Ser Leu Gin 
570 

GAA GAA GTG AAT GAA 
Glu Glu Val Asn Glu 
585 

GTA CGA GAT GCG TCT 
Val Arg Asp Ala Ser 

605 

AGG AAA GGG GGA AAT 
Arg Lys Gly Gly Asn 

620 

AAA TAT GGC TTC TCT 
Lys Tyr Gly Phe Ser 
635 

ATA AAC CAC TAC CGG 
He Asn His Tyr Arg 
650 

GAT GTG AAA TTA CTT 
Asp Val Lys Leu Leu 
665 

GTC AAA GAA GAT AAT 

Val "vr - ^ — 



495 

GAA ATT TTC 15 3 6 

Glu He Phe 

510 

AAT ACT GAA 1584 
Asn Thr Glu 



TGG AAT GAA 16 3 2 
Trp Asn Glu 



CCT ACT ACT 16 8 0 
Pro Thr Thr 
560 

AAT GCT GAA 172 8 
Asn Ala Glu 
575 

AAA CTT CGA 1776 

Lys Leu Arg 

590 

ACT AAA ATG 1824 
Thr Lys Met 

AAC AAA TTA 1872 
Asn Lys Leu 

GAC CCA TTA 192 0 

Asp Pro Leu 
640 

AAT GAA TCT 196 8 

Asn Glu Ser 
655 

TAT CCA GTA 2 016 

Tyr Pro Val 

670 

ATT GAA GCT 2 064 



70C 



5^ 



CGA GAA TAT GAT AGA TTA TAT GAA GAA TA^ 
Arg Glu Tyr Asp Arn Le:: ^v- "Iv 



nt 



ACA 



\-. _J 
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705 710 715 720 

ATC CAA ATG AAA AGG ACA GCT ATT GAA GCA TTT AAT GAA ACC ATA AAA 220 8 
lie Gin Met Lys Arg Thr Ala lie Glu Ala Phe Asn Glu Thr lie Lys 

725 730 735 

ATA TTT GAA GAA CAG TGC GAG ACC CAA GAG CGG TAG AGC AAA GAA TAG 225 6 
lie Phe Glu Glu Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr 

740 745 750 

ATA GAA AAG TTT AAA CGT GAA GGC AAT GAG AAA GAA ATA CAA AGG ATT 23 04 

lie Glu Lys Phe Lys Arg Glu Gly Asn Glu Lys Glu lie Gin Arg lie 
755 760 765 

15 ATG CAT AAT TAT GAT AAG TTG AAG TCT CGA ATC AGT GAA ATT ATT GAC 23 52 
Met His Asn Tyr Asp Lys Leu Lys Ser Arg lie Ser Glu lie lie Asp 
770 775 780 

AGT AG A AG A AG A TTG GAA GAA GAC TTG AAG AAG CAG GCA GCT GAG TAT 24 0 0 

20 Ser Arg Arg Arg Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr 
7B5 790 795 800 

CGA GAA ATT GAC AAA CGT ATG AAC AGC ATT AAA CCA GAC CTT ATC CAG 244 8 

Arg Glu lie Asp Lys Arg Met Asn Ser lie Lys Pro Asp Leu lie Gin 
25 805 810 815 

CTG AGA AAG ACG AGA GAC CAA TAG TTG ATG TGG TTG ACT CAA AAA GGT 2 4 96 

Leu Arg Lys Thr Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly 

820 825 830 

30 

GTT CGG CAA AAG AAG TTG AAC GAG TGG TTG GGC AAT GAA AAC ACT GAA 2 54 4 

Val Arg Gin Lys Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu 
835 840 845 

35 GAC CAA TAT TCA CTG GTG GAA GAT GAT GAA GAT TTG CCC CAT CAT GAT 2 5 92 

Asp Gin Tyr Ser Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp 
850 855 860 

GAG AAG ACA TGG AAT GTT GGA AGC AGC AAC CGA AAC AAA GCT GAA AAC 2 64 0 

40 Glu Lys Thr Trp Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn 
865 870 875 880 

CTG TTG CGA GGG AAG CGA GAT GGC ACT TTT CTT GTC CGG GAG AGC AGT 2 68 8 
Leu Leu Arg Gly Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser 
45 885 890 895 

AAA CAG GGC TGC TAT GCC TGC TCT GTA GTG GTG GAC GGC GAA GTA AAG 2'^3f 
Lys Gin Gly Cys Tyr Ala Cys Ser Val Vp^. ^" 



j2C 925 

55 TAT AAC TTG TAG AGC TCT CTG hJ^Ji GAA CTG GTG CTA CAT TAG 

Tyr Asn Leu l>/r Ser Ser :.e\: :.-.'r- . "" ^ : 
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25 



30 



35 



40 



45 



96 

930 935 940 

ACC TCC CTT GTG CAG CAC AAC GAC TCC CTC AAT GTC ACA CTA GCC TAC 2 8 80 

Thr Ser Leu Val Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr 
945 950 955 960 

CCA GTA TAT GCA CAG CAG AGG CGA TGA 2 90 7 

Pro Val Tyr Ala Gin Gin Arg Arg 

965 



(2) INFORMATION FOR SEQ ID NO: 49 



( -i ) SEQUENCE CKAKACTERISTICS : 
15 (A) LENGTH: 968 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 



Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly Val 


Val 


Pro 


He 


Leu 


1 








5 










10 










15 




Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 


















25 










30 






Glu 


Gly 


Glu 


Gly Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 






35 










40 










45 








Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 




50 










55 










60 










Leu 


Thr 


Tyr 


Gly Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 


65 










70 










75 










80 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 










85 










90 










95 




Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 


Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 








100 










105 










110 






Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 






115 










120 










125 








lie 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly Asn 


He 


Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 




130 










135 










140 










Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


145 










150 










155 










160 


Gly 


lie 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 










165 










170 










175 




Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 








180 










185 




















^ J— > ■ ■ 





























. .- . ,. : :A'\_i . y "o^ Asp Glu Leu Tyr Lys Ser 

^^25 230 235 240 

55 Gly Leu Arg Ser Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu T'.— 

2 4 5 : ' 
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Asp Tyr Lys Lys Glu Arg Glu Glu Asp lie Asp Leu His Leu Gly Asp 

260 265 270 

lie Leu Thr Val Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp 
275 280 285 

5 Gly Gin Glu Ala Arg Pro Glu Glu lie Gly Trp Leu Asn Gly Tyr Asn 
290 295 300 

Glu Thr Thr Gly Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr 
305 310 315 320 

lie Gly Arg Lys Lys lie Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro 
10 325 330 335 

Arg Pro Leu Pro Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val 

340 345 350 

Glu Gin Gin Ala Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro 
355 360 365 

15 Pro Asp lie Ala Pro Pro Leu Leu lie Lys Leu Val Glu Ala lie Glu 
370 375 380 

Lys Lys Gly Leu Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser 
385 390 395 400 

Asn Leu Ala Glu Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val 
20 405 410 415 

Asp Leu Glu Met He Asp Val His Val Leu Ala Asp Ala Phe Lys Arg 

420 425 430 

Tyr Leu Leu Asp Leu Pro Asn Pro Val He Pro Ala Ala Val Tyr Ser 
435 440 445 

25 Glu Met He Ser Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr He 
450 455 460 

Gin Leu Leu Lys Lys Leu He Arg Ser Pro Ser Tie Pro His Gin Tyr 
465 470 475 480 

Trp Leu Thr Leu Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin 
30 485 490 495 

Thr Ser Ser Lys Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe 

500 505 510 

Ser Pro Met Leu Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu 
515 520 525 

35 Asn Leu He Lys Val He Glu He Leu He Ser Thr Glu Trp Asn Glu 
530 535 540 

Arg Gin Pro Ala Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr 
545 550 555 560 

Val Ala Asn Asn Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu 
40 565 570 575 

Trp Tyr Trp Gly Asp He Ser Arg Glu Glu Val Asn Glu Lys Leu Arg 

580 585 590 

Asp Thr Ala Asp Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met 
595 600 605 

45 His Gly Asp Tyr Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu 
610 615 620 

He Lys He Phe His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu 
625 630 635 

Thr Phe Ser Ser "i ' " 

r r- 



55 



- : I J. 

6B0 



685 



'al Gly Ly!J Lys Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys S^r 
690 695 
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Arg Glu Tyr Asp Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu 
705 710 715 720 

lie Gin Met Lys Arg Thr Ala lie Glu Ala Phe Asn Glu Thr He Lys 

725 730 735 

5 He Phe Glu Glu Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr 

740 745 750 

He Glu Lys Phe Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He 

755 760 765 

Met His Asn Tyr Asp Lys Leu Lys Ser Arg He Ser Glu He He Asp 
10 770 775 7B0 

Ser Arg Arg Arg Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr 
785 790 795 800 

Arg Glu He Asp Lys Arg Met Asn Ser He Lys Pro Asp Leu He Gin 

805 810 815 

15 Leu Arg Lys Thr Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly 

820 825 830 

Val Arg Gin Lys Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu 

835 840 845 

Asp Gin Tyr Ser Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp 
20 850 855 860 

Glu Lys Thr Trp Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn 
865 870 875 880 

Leu Leu Arg Gly Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser 

885 890 895 

25 Lys Gin Gly Cys Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys 

900 905 910 

His Cys Val He Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro 

915 920 925 

Tyr Asn Leu Tyr Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His 
30 930 935 940 

Thr Ser Leu Val Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr 
945 950 955 960 

Pro Val Tyr Ala Gin Gin Arg Arg 

965 



35 



(2) INFORMATION FOR SEQ ID NO: 50: 



(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2160 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
45 (ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: I . . , 2 1 =^ 



Met Vai Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
55 1 5 ic 1- 
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GTC GAG CTG GAG GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

5 GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
10 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAG GGC GTG CAG TGC TTC AHC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
15 65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

20 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

25 GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 43 2 

30 lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 48 0 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
35 145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

40 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

45 CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 62 4 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



- ■-' v.. O..,^ v^j'^T^l >A . v_ /Av_ i v., ^ _ oU-'v- i U WM^, v,Jrt\J ^ M.L AJALt iLv. I ^ 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
55 225 23C 235 
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30 



100 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG ACC ATG TCG TCC ATC 76 8 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ser Ser lie 

245 250 255 

TTG CCA TTC ACG CCG CCA GTT GTG AAG AGA CTG CTG GGA TGG AAG AAG 816 

Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu Gly Trp Lys Lys 

260 265 270 

TCA GCT GGT GGG TCT GGA GGA GCA GGC GGA GGA GAG CAG AAT GGG CAG 864 

Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu Gin Asn Gly Gin 
275 280 285 



GAA GAA AAG TGG TGT GAG AAA GCA GTG A?J^. AGT CTG UTG AAG AAG CTA 912 

Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu Val Lys Lys Leu 
15 290 295 300 

AAG AAA ACA GGA CGA TTA GAT GAG CTT GAG AAA GCC ATC ACC ACT CAA 960 

Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala lie Thr Thr Gin 
305 310 315 320 

20 

AAC TGT AAT ACT AAA TGT GTT ACC ATA CCA AGC ACT TGC TCT GAA ATT 10 0 8 

Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr Cys Ser Glu He 

325 330 335 

25 TGG GGA CTG AGT ACA CCA AAT ACG ATA GAT CAG TGG GAT ACA ACA GGC 1056 

Trp Gly Leu Ser Thr Pro Asn Thr He Asp Gin Trp Asp Thr Thr Gly 

340 345 350 



CTT TAG AGC TTC TCT GAA CAA ACC AGG TCT CTT GAT GGT CGT CTC CAG 1104 
Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp Gly Arg Leu Gin 
355 360 365 



GTA TCC CAT CGA AAA GGA TTG CCA CAT GTT ATA TAT TGC CGA TTA TGG 1152 

Val Ser His Arg Lys Gly Leu Pro His Val lie Tyr Cys Arg Leu Trp 

35 370 375 380 

CGC TGG CCT GAT CTT CAC AGT CAT CAT GAA CTC AAG GCA ATT GAA AAC 12 0 0 

Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys Ala He Glu Asn 

385 390 395 400 

40 

TGC GAA TAT GCT TTT AAT CTT AAA AAG GAT GAA GTA TGT GTA AAC CCT 124 8 

Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val Cys Val Asn Pro 

405 410 415 

45 TAG CAC TAT CAG AGA GTT GAG ACA CCA GTT TTG CCT CCA GTA TTA GTG 12 9 6 

Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val 

420 425 430 



.... ...... .... A.^C ACT A^\C TTC CCA GCA GGA ATT GAG CCA 13 92 

Thr Hie Ser lie Pro Glu Asn Thr Asn Fhe Pro Ala Gly He Glu Pro 
55 450 455 460 
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CAG AGT AAT TAT ATT CCA GAA ACG CCA CCT CCT GGA TAT ATC AGT GAA 144 0 

Gin Ser Asn Tyr He Pro Glu Thr Pro Pro Pro Gly Tyr He Ser Glu 
465 470 475 480 

5 GAT GGA GAA ACA AGT GAC CAA CAG TTG AAT CAA AGT ATG GAC ACA GGC 14 8 8 

Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser Met Asp Thr Gly 

485 490 495 

TCT CCA GCA GAA CTA TCT CCT ACT ACT CTT TCC CCT GTT AAT CAT AGC 15 3 6 

10 Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro Val Asn His Ser 

500 505 510 

TTG GAT TTA CAG CCA GTT ACT TAG TCA GA^ CCT GCA TTT TGG TGT TCA 15 84 

Leii Asp Lgu Gin Pro Val Thr Tyr Ser Glu Pro Ala Phe Trp Cys Ser 
15 515 520 525 

ATA GCA TAT TAT GAA TTA AAT CAG AGG GTT GGA GAA ACC TTC CAT GCA 163 2 

He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu Thr Phe His Ala 
530 535 540 

20 

TCA CAG CCC TCA CTC ACT GTA GAT GGC TTT ACA GAC CCA TCA AAT TCA 16 8 0 

Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp Pro Ser Asn Ser 
545 550 555 560 

25 GAG AGG TTC TGC TTA GGT TTA CTC TCC AAT GTT AAC CGA AAT GCC ACG 172 8 

Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn Arg Asn Ala Thr 

565 570 575 

GTA GAA ATG ACA AGA AGG CAT ATA GGA AGA GGA GTG CGC TTA TAG TAC 17 7 6 

30 Val Glu Met Thr Arg Arg His He Gly Arg Gly Val Arg Leu Tyr Tyr 

580 585 590 

ATA GGT GGG GAA GTT TTT GCT GAG TGC CTA AGT GAT AGT GCA ATC TTT 18 24 

He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp Ser Ala He Phe 
35 595 600 605 

GTG CAG AGC CCC AAT TGT AAT CAG AGA TAT GGC TGG CAC CCT GCA ACA 18 72 

Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp His Pro Ala Thr 
610 615 520 

40 

GTG TGT AAA ATT CCA CCA GGC TGT AAT CTG AAG ATC TTC AAC AAC CAG 192 0 

Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He Phe Asn Asn Gin 
625 630 635 640 

45 GAA TTT GCT GCT CTT CTG GCT CAG TCT GTT AAT CAG GGT TTT GAA GCC 196 8 

Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin Gly Phe Glu Ala 

645 650 655 



.s ..J ..s V _: A V ^ jr\ ^ ^ ^v_r _ j-VV: M'^ Lj Lj i M ^^/\ /\w i Awj. XO^ ^ U t> ^ 

Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr Ser Thr Pro Cys 
55 675 680 685 
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TGG ATT GAA CTT CAT CTG AAT GGA CCT CTA GAG TGG TTG GAG AAA GTA 2112 

Trp lie Glu Leu His Leu Asn Gly Pro Leu Gin Trp Leu Asp Lys Val 

690 595 700 

TTA ACT CAG ATG GGA TCC CCT TCA GTG CGT TGC TCA AGC ATG TCA TAA 216 0 

Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser Ser Met Ser 

705 710 715 



10 (2) INFORMATION FOR SEQ ID NO: 51 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 amino acidf? 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

25 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
30 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

35 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
40 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 ISO 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

45 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 



. ■ ^ ■:: A> A.ci A..a :Jcr Asn Ser Tnr Ket Ser Ser I^e 

245 250 255 

55 Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu Glv Trr T/.t 

2 6 C : - 
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Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu Gin Asn Gly Gin 
275 280 285 

Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu Val Lys Lys Leu 
290 295 300 

5 Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala lie Thr Thr Gin 
305 310 315 320 

Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr Cys Ser Glu He 

325 330 335 

Trp Gly Leu Ser Thr Pro Asn Thr He Asp Gin Trp Asp Thr Thr Gly 
10 340 345 350 

Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp Gly Arg Leu Gin 

355 360 365 

Val Ser His Arg Lys Gly Leu Pro His Val He Tyr Cys Arg Leu Trp 
•^^o 375 380 

15 Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys Ala He Glu Asn 
385 390 395 400 

Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val Cys Val Asn Pro 

405 410 415 

Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val 
20 420 425 430 

Pro Arg His Thr Glu He Leu Thr Glu Leu Pro Pro Leu Asp Asp Tyr 

435 440 445 

Thr His Ser He Pro Glu Asn Thr Asn Phe Pro Ala Gly He Glu Pro 
450 455 460 

25 Gin Ser Asn Tyr He Pro Glu Thr Pro Pro Pro Gly Tyr He Ser Glu 
465 470 475 480 

Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser Met Asp Thr Gly 

485 490 495 

Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro Val Asn His Ser 
30 500 505 510 

Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala Phe Trp Cys Ser 

515 520 525 

He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu Thr Phe His Ala 
530 535 540 

35 Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp Pro Ser Asn Ser 
545 550 555 560 

Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn Arg Asn Ala Thr 

565 570 575 

Val Glu Met Thr Arg Arg His He Gly Arg Gly Val Arg Leu Tyr Tyr 
40 580 585 590 

He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp Ser Ala He Phe 

595 600 605 

Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp His Pro Ala Thr 
610 615 620 

45 Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He Phe Asn Asn Gin 
625 630 635 640 

Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin Gly Phe Glu Ala 

645 



re 



u Gin Trp Leu Asp Lys Val 



690 695 700 

55 Leu Thr Gin Met Gly Ser Fro Ser Val Arg Cys Ser Ser Me*: 
7 0 5 : ^ 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2421 base pairs 

{B} TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 
{B) LOCATION: 1...2418 
15 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

20 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
25 20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

30 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

35 CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 86 

40 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
45 100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ~ - ' 

Val Lys Phe Glu Gly Asr Thr ' ^.-^ " -> ^ ' ■ 



■ ■ . . . .■ . T . ^ :i ^, ..jV s Leu 1 u Tyr 

^ 13 5 14 0 

55 AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG GAG ;-7-.'- 

Asn Tyr Asn Pr^r Hi r A^-- .■ ■ ' ••■ • ■ 
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105 

145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGO 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 6 24 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 700 205 

15 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Mec Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

20 Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG AAT TCA ACC ATG GAC 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Asn Ser Thr Met Asp 
25 245 250 255 

AAT ATG TCT ATT ACG AAT ACA CCA ACA AGT AAT GAT GCC TGT CTG AGC 816 

Asn Met Ser lie Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys Leu Ser 

260 265 270 

30 

ATT GTG CAT AGT TTG ATG TGC CAT AGA CAA GGT GGA GAG AGT GAA ACA 864 

lie Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser Glu Thr 
275 280 285 

35 TTT GCA AAA AGA GCA ATT GAA AGT TTG GTA AAG AAG CTG AAG GAG AAA 912 
Phe Ala Lys Arg Ala lie Glu Ser Leu Val Lys Lys Leu Lys Glu Lys 
290 295 300 

AAA GAT GAA TTG GAT TCT TTA ATA ACA GCT ATA ACT ACA AAT GGA GCT 96 0 

40 Lys Asp Glu Leu Asp Ser Leu lie Thr Ala lie Thr Thr Asn Gly Ala 
305 310 315 320 

CAT CCT AGT AAA TGT GTT ACC ATA CAG AGA ACA TTG GAT GGG AGG CTT 100 8 

His Pro Ser Lys Cys Val Thr lie Gin Arg Thr Leu Asp Gly Arg Leu 
45 325 330 335 

CAG GTG GCT GGT CGG AAA GGA TTT CCT CAT GTG ATC TAT GCC CGT CT^ - n r r 

Gin Val Ala Gly Ara rv.. 



. _ .. / ^ ; . : . . - . . f ■ .... ^^'j' ills ,Ci^ 

36C 365 
55 TGT CAG TAT GCG TTT GAC TTA AAA TGT GAT AGT GTC T^T ^-'7'^ 
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370 375 380 

TAT CAC TAC GAA CGA GTT GTA TCA CCT GGA ATT GAT CTC TCA GGA TTA 12 0 0 

Tyr His Tyr Glu Arg Val Val Ser Pro Gly lie Asp Leu Ser Gly Leu 
385 390 395 400 

ACA CTG CAG AGT AAT GCT CCA TCA AGT ATG ATG GTG AAG GAT GAA TAT 124 8 

Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp Glu Tyr 

405 410 415 

GTG CAT GAG TTT GAG GGA CAG CCA TCG TTG TCC ACT GAA GGA CAT TCA 12 96 

Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly His Ser 

420 430 

15 ATT CAA ACC ATC CAG CAT CCA CCA AGT AAT CGT GCA TCG ACA GAG ACA 1344 
lie Gin Thr lie Gin His Pro Pro Ser Asn Arg Ala Ser Thr Glu Thr 
435 440 445 



10 



20 



40 



TAC AGC ACC CCA GCT CTG TTA GCC CCA TCT GAG TCT AAT GCT ACC AGC 13 92 

Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala Thr Ser 
450 455 460 



ACT GCC AAC TTT CCC AAC ATT CCT GTG GCT TCC ACA AGT CAG CCT GCC 1440 
Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin Pro Ala 
25 465 470 475 480 

AGT ATA CTG GGG GGC AGC CAT AGT GA.A GGA CTG TTG CAG ATA GCA TCA 14 88 

Ser He Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He Ala Ser 

485 490 495 

30 

GGG CCT CAG CCA GGA CAG CAG CAG AAT GGA TTT ACT GGT CAG CCA GCT 153 6 

Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin Pro Ala 

500 505 510 

35 ACT TAC CAT CAT AAC AGC ACT ACC ACC TGG ACT GGA AGT AGG ACT GCA 15 84 
Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg Thr Ala 
515 520 525 



CCA TAC ACA CCT AAT TTG CCT CAC CAC CAA AAC GGC CAT CTT CAG CAC 163 2 

Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu Gin His 
530 535 540 



CAC CCG CCT ATG CCG CCC CAT CCC GGA CAT TAC TGG CCT GTT CAC AAT 16 8 0 

His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val His Asn 
45 545 550 555 560 

GAG CTT GCA TTC CAG CCT CCC ATT TCC AAT CAT CCT GCT CCT GAG ^TAT 
Glu Leu Ala Phe Gin Pro Pr'- "lo 



- 5 S 5: 5 9 0 

55 TTT AAG GTT CCT TCA AGC TGC CCT ATT GTT ACT GTT GAT "--^ 

Phe Lvs Val Pro Ser ^- ' ' ■ 
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595 600 605 

GAC CCT TCT GGA GGA GAT CGC TTT TGT TTG GGT CAA CTC TCC AAT GTC 1872 
Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser Asn Val 
610 615 620 

CAC AGG ACA GAA GCC ATT GAG AGA GCA AGG TTG CAC ATA GGC AAA GGT 192 0 
His Arg Thr Glu Ala He Glu Arg Ala Arg Leu His He Gly Lys Gly 
625 630 635 640 

GTG CAG TTG GAA TGT AAA GGT GAA GGT GAT GTT TGG GTC AGG TGC CTT 196 8 

Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg Cys Leu 

645 650 655 

15 AGT GAC CAC GCG GTC TTT GTA CAG AGT TAC TAC TTA GAC AGA GAA GCT 2016 
Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg Glu Ala 

660 665 670 



10 



20 



40 



55 



GGG CGT GCA CCT GGA GAT GCT GTT CAT AAG ATC TAC CCA AGT GCA TAT 2 064 

Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser Ala Tyr 
675 680 685 



ATA AAG GTC TTT GAT TTG CGT CAG TGT CAT CGA CAG ATG CAG CAG CAG 2112 
He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin Gin Gin 
25 690 695 700 

GCG GCT ACT GCA CAA GCT GCA GCA GCT GCC CAG GCA GCA GCC GTG GCA 216 0 

Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala Val Ala 
705 710 715 720 

30 

GGA AAC ATC CCT GGC CCA GGA TCA GTA GGT GGA ATA GCT CCA GCT ATC 220 8 

Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro Ala He 

725 730 735 

35 AGT CTG TCA GCT GCT GCT GGA ATT GGT GTT GAT GAC CTT CGT CGC TTA 22 5 6 
Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg Arg Leu 

740 745 750 



TGC ATA CTC AGG ATG AGT TTT GTG AAA GGC TGG GGA CCG GAT TAC CCA 23 04 

Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp Tyr Pro 
755 760 765 



AGA CAG AGC ATC AAA GAA ACA CCT TGC TGG ATT GAA ATT CAC TTA CAC 23 5 2 

Arg Gin Ser He Lys Glu Thr Pro Cys Trp He Glu He His Leu His 

45 770 775 780 

CGG GCC CTC CAG CTC CTA GAC GAA GTA CTT CAT ACC ATG CCG ATT r.r^T- ^ i ^ 

Arg Ala Leu Gin JjeM ^. 



8C 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B06 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
t 5 10 15 

15 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
20 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 60 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

25 Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
30 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

35 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 2C0 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
40 210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Asn Ser Thr Met Asp 

245 250 255 

45 Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys Leu Ser 

260 265 270 

He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser Glu Thr 
2 7 5 2 B n 



3 2 3 



lie Gin Arg Tnr Leu Asp Gly Arg Leu 
325 330 335 

55 Gin Val Ala Gly Arg Lys Gly Fho Pro Hig Val He Tvr Ala Ar-^ 

3 4 0 
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Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val Lys Tyr 

355 360 365 

Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val Asn Pro 
370 375 380 

5 Tyr His Tyr Glu Arg Val Val Ser Pro Gly lie Asp Leu Ser Gly Leu 
385 390 395 400 

Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp Glu Tyr 

405 410 415 

Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly His Ser 
10 420 425 430 

lie Gin Thr lie Gin His Pro Pro Ser Asn Arg Ala Ser Thr Glu Thr 

435 440 445 

Tyr Ser Thr Pro Ala Leu Len Ala Pro Ser Glu Ser Asn Ala Thr Ser 
450 455 460 

15 Thr Ala Asn Phe Pro Asn lie Pro Val Ala Ser Thr Ser Gin Pro Ala 
465 470 475 4B0 

Ser lie Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin lie Ala Ser 

485 490 495 

Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin Pro Ala 
20 500 505 510 

Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg Thr Ala 

515 520 525 

Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu Gin His 
530 535 540 

25 His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val His Asn 
545 550 555 560 

Glu Leu Ala Phe Gin Pro Pro Ila Ser Asn His Pro Ala Pro Glu Tyr 

565 570 575 

Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly Glu Thr 
30 580 585 590 

Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly Tyr Val 

595 600 605 

Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser Asn Val 
610 615 620 

35 His Arg Thr Glu Ala He Glu Arg Ala Arg Leu His He Gly Lys Gly 
625 630 635 640 

Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg Cys Leu 

645 650 655 

Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg Glu Ala 
40 660 665 670 

Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser Ala Tyr 

675 680 685 

He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin Gin Gin 
690 695 700 

45 Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala Val Ala 
705 710 715 720 

Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro Ala Tl- 

72^ 



55 



■'65 

o..ri Ser i.e Lys G.Lu 7hr Pro Cys Trp lie Glu He His Leu His 

"^70 77 5 780 

Arg Ala Leu Gin Leu Leu Asp Glu Val Leu Hi.':: Thr ■■'^^ r--- - ■, 
7 p =^ - 
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Asp Pro Gin Pro Leu Asp 

805 

(2) INFORMATION FOR SEQ ID NO: 54 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

15 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .3117 
{D) OTHER INFORMATION: 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

25 GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
30 Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
35 50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

40 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

45 CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 33 6 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAf^- TTr ^.r.^, ■ • 



^ -J K^-^-^ 'v, ^ 'v-J Lt/Yo 4 J 



I.c Asp Fhe :-y8 Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
55 130 135 140 
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AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAG AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

5 GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Aan Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

10 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

160 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
15 195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

20 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TCC 72 0 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

25 GGA CTC AGA TCT ACC ATG GCG GGC TGG ATC CAG GCC CAG CAG CTG CAG 76 8 

Gly Leu Arg Ser Thr Met Ala Gly Trp lie Gin Ala Gin Gin Leu Gin 

245 250 255 

GGA GAC GCG CTG CGC CAG ATG CAG GTG CTG TAG GGC CAG CAC TTC CCC 816 
30 Gly Asp Ala Leu Arg Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro 

260 265 270 

ATC GAG GTC CGG CAC TAG TTG GCC CAG TGG ATT GAG AGC CAG CCA TGG 864 
lie Glu Val Arg His Tyr Leu Ala Gin Trp lie Glu Ser Gin Pro Trp 
35 275 280 285 

GAT GCC ATT GAC TTG GAC AAT CCC CAG GAC AGA GCC CAA GCC ACC CAG 912 

Asp Ala lie Asp Leu Asp Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin 
290 295 300 

40 

CTC CTG GAG GGC CTG GTG CAG GAG CTG CAG AAG AAG GCG GAG CAC CAG 960 

Leu Leu Glu Gly Leu Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin 

305 310 315 320 

45 GTG GGG GAA GAT GGG TTT TTA CTG AAG ATC AAG CTG GGG CAC TAC GCC 100 8 

Val Gly Glu Asp Gly Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala 

325 330 335 



■ " ...... . , » , , , , , T . , ^Tr^ ^ ^ -n ^. ^ ^ /— I i> • ^ ^ - , 

■ ■ ■ • t\.i\^ ■_ i O 1 V- ^^M. ^ ^ ^ J 't 

Cys lie Arg HiG He Leu Tyr Asn Glu Gl:i Arg Leu Val Arg Glu Ala 

55 355 36C 365 
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AAC AAT TGC AGC TCT CCG OCT GGG ATC CTG GTT GAC GCC ATG TCC CAG 115 2 
Asn Asn Cys Ser Ser Pro Ala Gly lie Leu Val Asp Ala Met Ser Gin 
370 375 380 

5 AAG CAC CTT CAG ATC AAC CAG AC A TTT GAG GAG CTG CGA CTG GTC ACG 12 0 0 
Lys His Leu Gin lie Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr 
385 390 395 400 

CAG GAC ACA GAG AAT GAG CTG AAG AAA CTG CAG CAG ACT CAG GAG TAC 124 8 

10 Gin Asp Thr Glu Asn Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr 

405 410 415 

TTC ATC ATC CAG TAC CAG GAG AGC CTG AGG ATC CAA GCT CAG TTT GCC 12 9b 

Phe lie He Gin Tyr Gin Glu Ser i-eu Arg He Gin Ala Gin Phe Ala 
15 420 425 430 

CAG CTG GCC CAG CTG AGC CCC CAG GAG CGT CTG AGC CGG GAG ACG GCC 13 44 
Gin Leu Ala Gin Leu Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala 
435 440 445 

20 

CTC CAG CAG AAG CAG GTG TCT CTG GAG GCC TGG TTG CAG CGT GAG GCA 13 92 
Leu Gin Gin Lys Gin Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala 
450 455 460 

25 CAG ACA CTG CAG CAG TAC CGC GTG GAG CTG GCC GAG AAG CAC CAG AAG 144 0 

Gin Thr Leu Gin Gin Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys 
465 470 475 430 

ACC CTG CAG CTG CTG CGG AAG CAG CAG ACC ATC ATC CTG GAT GAC GAG 14 8 8 
30 Thr Leu Gin Leu Leu Arg Lys Gin Gin Thr He lie Leu Asp Asp Glu 

485 490 495 

CTG ATC CAG TGG AAG CGG CGG CAG CAG CTG GCC GGG AAC GGC GGG CCC 15 3 6 
Leu He Gin Trp Lys Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro 
35 500 505 510 

CCC GAG GGC AGC CTG GAC GTG CTA CAG TCC TGG TGT GAG AAG TTG GCC 15 84 

Pro Glu Gly Ser Leu Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala 
515 520 525 

40 

GAG ATC ATC TGG CAG AAC CGG CAG CAG ATC CGC AGG GCT GAG CAC CTC 163 2 

Glu He He Trp Gin Asn Arg Gin Gin He Arg Arg Ala Glu His Leu 
530 535 540 

45 TGC CAG CAG CTG CCC ATC CCC GGC CCA GTG GAG GAG ATG CTG GCC GAG 168 0 

Cys Gin Gin Leu Pro He Pro Gly Pro Val Glu Glu Met Leu Ala Glu 
545 550 555 560 



GTC AAC GCC 



^ e 



lie 



55 



580 



._ ^ ^ v^" ^ ^./\^_> J-\\Z^ CAO y-\C^ J-^Mw 

I^ro Pro Gin Val Leu I.ys Thr Gin Thr Lye 
585 590 



1 ^' 6 
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TTT GCA GCC ACC GTA CGC CTG CTG GTG GGC GGG AAG CTG AAC GTG CAC 1824 
Phe Ala Ala Thr Val Arg Leu Leu Val Gly Gly Lys Leu Asn Val His 
595 600 605 

5 ATG AAT CCC CCC CAG GTG AAG GCC ACC ATC ATC AGT GAG CAG CAG GCC 1872 
Met Asn Pro Pro Gin Val Lys Ala Thr lie lie Ser Glu Gin Gin Ala 
610 615 620 

AAG TCT CTG CTT AAA AAT GAG AAC ACC CGC AAC GAG TGC AGT GGT GAG 192 0 

10 Lys Ser Leu Leu Lys Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu 
625 630 635 640 



n <-\ ^ 2 

^ -7 O O 



ATC CTG AAC AAC TGC TGC GTG ATG GAG TAC CAC CAA GCC ACG GGC ACC 
lie Leu Asn Asn CyR Cys Val MeL Glu Tyr His Gin Ala Thr Gly Thr 
15 645 650 655 

CTC AGT GCC CAC TTC AGG AAC ATG TCA CTG AAG AGG ATC AAG CGT GCT 2 016 
Leu Ser Ala His Phe Arg Asn Met Ser Leu Lys Arg lie Lys Arg Ala 

660 665 670 

20 

GAC CGG CGG GGT GCA GAG TCC GTG AC A GAG GAG AAG TTC ACA GTC CTG 2 064 
Asp Arg Arg Gly Ala Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu 
675 6B0 685 

25 TTT GAG TCT CAG TTC AGT GTT GGC AGC AAT GAG CTT GTG TTC CAG GTG 2112 
Phe Glu Ser Gin Phe Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val 
690 695 700 

AAG ACT CTG TCC CTA CCT GTG GTT GTC ATC GTC CAC GGC AGC CAG GAC 2160 
30 Lys Thr Leu Ser Leu Pro Val Val Val He Val His Gly Ser Gin Asp 
705 710 715 720 

CAC AAT GCC ACG GCT ACT GTG CTG TGG GAC AAT GCC TTT GCT GAG CCG 2208 
His Asn Ala Thr Ala Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro 
35 725 730 735 

GGC AGG GTG CCA TTT GCC GTG CCT GAC AAA GTG CTG TGG CCG CAG CTG 225 6 

Gly Arg Val Pro Phe Ala Val Pro Asp Lys Val Leu Trp Pro Gin Leu 

740 745 750 

40 

TGT GAG GCG CTC AAC ATG AAA TTC AAG GCC GAA GTG CAG AGC AAC CGG 2 3 04 

Cys Glu Ala Leu Asn Met Lys Phe Lys Ala Glu Val Gin Ser Asn Arg 

755 760 765 

45 GGC CTG ACC AAG GAG AAC CTC GTG TTC CTG GCG CAG AAA CTG TTC AAC 2 3 52 

Gly Leu Thr Lys Glu Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn 
770 775 780 

AAC AGC AGC A^^ ^^-^ '^"^ "^'^ " ' 



ACC TTC 



•■* ^ 



55 



Pr^e Asn Arg Glu Asn 

ec5 



.GU Pro Gly Trp Asn Tyr Thr Phe 
810 815 
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CAG TGG TTT 
Gin Trp Phe 

5 CAC TGG AAT 
His Trp Asn 
835 

CAC GAC CTG 
10 His Asp Leu 
850 

AGT GAC TCA 
Ser Asp Ser 
15 865 

CCG GAA CGC 
Pro Glu Arg 

20 

TCC ATC AGG 
Ser lie Arg 

25 TAT GTG TTT 
Tyr Val Phe 
915 

ACT OCT GTG 
30 Thr Pro Val 
930 

AAG CAA GTG 
Lys Gin Val 
35 945 

AGC AGC GCC 
Ser Ser Ala 

40 

CAG GCT CCC 
Gin Ala Pro 

45 CAG GAT GGA 
Gin Asp Gly 
995 

GTG GAG GAA 



GAC GGG GTG ATG GAG 
Asp Gly Val Met Glu 
820 

GAT GGG GCC ATC CTA 
Asp Gly Ala lie Leu 

840 

CTC ATC AAC AAG CCC 
Leu lie Asn Lys Pro 

855 

GAA ATC GGG GGC ATC 
Glu lie m.y Gly He 
870 

AAC CTG TGG AAC CTG 
Asn Leu Trp Asn Leu 
885 

TCC CTG GCT GAC CGG 
Ser Leu Ala Asp Arg 
900 

CCT GAC CGC CCC AAG 
Pro Asp Arg Pro Lys 

920 

CTG GCT AAA GCT GTT 
Leu Ala Lys Ala Val 

935 

GTC CCT GAG TTT GTG 
Val Pro Glu Phe Val 
950 

ACG TAC ATG GAC CAG 
Thr Tyr Met Asp Gin 
965 

TAT AAC ATG TAC CCA 
Tyr Asn Met Tyr Pro 
980 

GAA TTC GAC CTG GAT 
Glu Phe Asp Leu Asp 

1000 
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GTG TTG AAG AAG CAC 
Val Leu Lys Lys His 
825 

GGT TTT GTG AAT AAG 
Gly Phe Val Asn Lys 

845 

GAC GGG ACC TTC TTG 
Asp Gly Thr Phe Leu 

860 

ACC ATC GCC TGG AAG 
Thr He Ala Trp Lys 
875 

AAA CCA TTC ACC ACG 
Lys Pro Phe Thr Thr 
890 

CTG GGG GAC CTG AGC 
Leu Gly Asp Leu Ser 
905 

GAT GAG GTC TTC TCC 
Asp Glu Val Phe Ser 

925 

GAT GGA TAT GTG AAA 
Asp Gly Tyr Val Lys 

940 

AAT GCA TCT GCA GAT 
Asn Ala Ser Ala Asp 
955 

GCC CCC TCC CCA GCT 
Ala Pro Ser Pro Ala 
970 

CAG AAC CCT GAC CAT 
Gin Asn Pro Asp His 
985 

GAG ACC ATG GAT GTG 
Glu Thr Met Asp Val 

1005 



CAC AAG CCC 24 96 

His Lys Pro 

830 

CAA CAG GCC 2544 
Gin Gin Ala 



TTG CGC TTT 2 5 92 
Leu Arg Phe 

TTT GAC TCC 2 64 0 

Phe Asp Ser 
880 

CGG GAT TTC 26 8 8 

Arg Asp Phe 
895 

TAT CTC ATC 2736 

Tyr Leu He 

910 

AAG TAC TAC 27 84 

Lys Tyr Tyr 



CCA CAG ATC 2832 
Pro Gin He 



GCT GGG GGC 2 88 0 
Ala Gly Gly 
960 

GTG TGC CCC 2 92 8 

Val Cys Pro 
975 

GTA CTC GAT 2 97 6 

Val Leu Asp 

990 

GCC AGG CAC 3 02 4 

Ala Arg His 



55 



r r o Ala c; 1. v 



102 5 



Le'^ Phe Thr Ser Ala Arg Gly Ser Leu Ser 
0 3 0 10 3 5 1 



^ ^ z 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1039 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

15 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
20 35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

25 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 lOS 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
30 115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 ISO 155 160 

35 Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 1B5 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
40 195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

45 Gly Leu Arg Ser Thr Met Ala Gly Trp He Gin Ala Gin Gin Leu Gin 

245 250 255 

Gly Asp Ala Leu Arg Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro 

260 265 270 

He Glu Val Ara ' 



......... ._: .1 ; . ^'/..j j^/'G A i el iiio Gin 

jO'^ 310 315 32C 

55 Val Gly Glu Asp Gly Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala 

325 3 3^ 
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Thr Gin Leu Gin Lys Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg 

340 345 350 

Cys lie Arg His lie Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala 
355 360 365 

5 Asn Asn Cys Ser Ser Pro Ala Gly lie Leu Val Asp Ala Met Ser Gin 
370 375 380 

Lys His Leu Gin lie Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr 
385 390 395 400 

Gin Asp Thr Glu Asn Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr 
10 405 410 415 

Phe He He Gin Tyr Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala 

420 425 430 

Gin Leu Ala Gin Leu Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala 
435 44U 445 

15 Leu Gin Gin Lys Gin Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala 
450 455 460 

Gin Thr Leu Gin Gin Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys 
465 470 475 480 

Thr Leu Gin Leu Leu Arg Lys Gin Gin Thr He He Leu Asp Asp Glu 
20 485 490 495 

Leu He Gin Trp Lys Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro 

500 505 510 

Pro Glu Gly Ser Leu Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala 
515 520 525 

25 Glu He He Trp Gin Asn Arg Gin Gin He Arg Arg Ala Glu His Leu 
530 535 540 

Cys Gin Gin Leu Pro He Pro Gly Pro Val Glu Glu Met Leu Ala Glu 
545 550 555 560 

Val Asn Ala Thr He Thr Asp He He Ser Ala Leu Val Thr Ser Thr 
30 565 570 575 

Phe He He Glu Lys Gin Pro Pro Gin Val Leu Lys Thr Gin Thr Lys 

580 585 590 

Phe Ala Ala Thr Val Arg Leu Leu Val Gly Gly Lys Leu Asn Val His 
595 600 605 

35 Met Asn Pro Pro Gin Val Lys Ala Thr He He Ser Glu Gin Gin Ala 
610 615 620 

Lys Ser Leu Leu Lys Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu 
625 630 635 640 

He Leu Asn Asn Cys Cys Val Met Glu Tyr His Gin Ala Thr Gly Thr 
40 645 650 655 

Leu Ser Ala His Phe Arg Asn Met Ser Leu Lys Arg He Lys Arg Ala 

660 665 670 

Asp Arg Arg Gly Ala Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu 
675 680 685 

45 Phe Glu Ser Gin Phe Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val 
690 695 700 

Lys Thr Leu Ser Leu Pro Val Val Val He Val His Gly Ser Gin Asp 
705 710 715 



sn A r a 



755 



' ■ ■■■ l-'.'j:.. i:;e L^yy, A. a ^^u \'al Gin Ser A 

760 765 

55 Gly Leu Thr Lys Glu Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn 
770 7-7^ 
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Asn Ser Ser Ser His Leu Glu Asp Tyr Ser Gly Leu Ser Val Ser Trp 
785 790 795 800 

Ser Gin Phe Asn Arg Glu Asn Leu Pro Gly Trp Asn Tyr Thr Phe Trp 

605 810 815 

5 Gin Trp Phe Asp Gly Val Met Glu Val Leu Lys Lys His His Lys Pro 

820 825 830 

His Trp Asn Asp Gly Ala lie Leu Gly Phe Val Asn Lys Gin Gin Ala 

835 840 845 

His Asp Leu Leu lie Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe 
10 850 855 860 

Ser Asp Ser Glu He Gly Gly He Thr lie Ala Trp Lys Phe Asp Ser 
865 870 875 880 

Pro Glu Arg Asn Leu Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe 

885 390 895 

15 Ser He Arg Ser Leu Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu He 

900 905 910 

Tyr Val Phe Pro Asp Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr 

915 920 925 

Thr Pro Val Leu Ala Lys Ala Val Asp Gly Tyr Val Lys Pro Gin He 
20 930 935 940 

Lys Gin Val Val Pro Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly 
945 950 955 960 

Ser Ser Ala Thr Tyr Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro 

965 970 975 

25 Gin Ala Pro Tyr Asn Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp 

980 985 990 

Gin Asp Gly Glu Phe Asp Leu Asp Glu Thr Met Asp Val Ala Arg His 

995 1000 1005 

Val Glu Glu Leu Leu Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu 
30 1010 1015 1020 

Ser Pro Pro Ala Gly Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser 
025 1030 1035 1 



35 



r 



(2) INFORMATION FOR SEQ ID NO: 56 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

45 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1872 
(D) OTHER INFORMATION: 



(xi) SEQUENCE r^F.f^r--^ -r---- 



15 



55 GGA ACT GCT GGG GTC GTC CGG GTG GTC CCC GGG GAG GTG GAG GTG GTG 96 

Gly Thr Ala Gly Val Val Pro Val Val Pro Glv Glu ^ ■ " ~/ 
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20 25 30 

AAG GGG GAG CCA TTC GAT GTG GGC CCA CGC TAC ACG CAG CTG CAG TAC 14 4 

Lys Gly Gin Pro Phe Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr 
35 40 45 

ATC GGC GAG GGC GCG TAC GGC ATG GTC AGC TCA GCT TAT GAC CAC GTG 192 
lie Gly Glu Gly Ala Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val 
50 55 60 

CGC AAG ACC AGA GTG GCC ATC AAG AAG ATC AGC CCC TTT GAG CAT CAA 24 0 

Arg Lys Thr Arg Val Ala lie Lys Lys lie Ser Pro Phe Glu His Gin 
65 70 75 SO 

15 ACC TAC TGT CAG CGC ACG CTG AGG GAG ATC CAG ATC TTG CTG CGA TTC 28 8 

Thr Tyr Cys Gin Arg Thr Leu Arg Glu lie Gin lie Leu Leu Arg Phe 

85 90 95 

CGC CAT GAG AAT GTT ATA GGC ATC CGA GAC ATC CTC AGA GCG CCC ACC 3 36 

20 Arg His Glu Asn Val lie Gly He Arg Asp He Leu Arg Ala Pro Thr 

100 105 110 

CTG GAA GCC ATG AGA GAT GTT TAC ATT GTT CAG GAC CTC ATG GAG ACA 3 84 

Leu Glu Ala Met Arg Asp Val Tyr He Val Gin Asp Leu Met Glu Thr 
25 115 120 125 

GAC CTG TAC AAG CTG CTT AAA AGC CAG CAG CTG AGC AAT GAC CAC ATC 432 

Asp Leu Tyr Lys Leu Leu Lys Ser Gin Gin Leu Ser Asn Asp His He 

130 135 140 

30 

TGC TAC TTC CTC TAC CAG ATC CTC CGG GGC CTC AAG TAT ATA CAC TCA 480 

Cys Tyr Phe Leu Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser 

145 150 155 160 

35 GCC AAT GTG CTG CAC CGG GAC CTG AAG CCT TCC AAT CTG CTT ATC AAC 52 8 

Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu Leu He Asn 

165 170 175 

ACC ACC TGC GAC CTT AAG ATC TGT GAT TTT GGC CTG GCC CGG ATT GCT 5 76 

40 Thr Thr Cys Asp Leu Lys He Cys Asp Phe Gly Leu Ala Arg He Ala 

180 185 190 

GAC CCT GAG CAC GAC CAC ACT GGC TTT CTG ACG GAG TAT GTG GCC ACA 624 
Asp Pro Glu His Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr 
45 195 200 205 

CGC TGG TAC CGA GCC CCA GAG ATC ATG CTT AAT TCC AAG GGC TAC ACC 67 2 

Arg Trp Tyr Arg Ala Pro Glu He Met Leu Asn Se- "v- r.- - 
91 n 



24 0 



55 TCC AAC CGG CCC ATC TTC CCC GGC AAG CAC TAC CTG GAC CAG CTC AAC 

Ser Asn Arg Pro He Phe Pro n^v T/.-n ^'-i^ "T-.-r- : ' ■' ■ 
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245 250 255 

CAC ATT CTA GGT ATC TTG GOT TCC CCA TCC CAG GAG GAC CTT AAT TGC 816 

His lie Leu Gly lie Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys 

260 265 270 

ATC ATT AAC ATG AAG GCC CGA AAC TAG CTG CAG TCT CTG CCC TCG AAA 8 64 

lie lie Asn Met Lys Ala Arg Asn Tyr Leu Gin Ser Leu Pro Ser Lys 
275 280 285 

ACC AAG GTG GCT TGG GCC AAG CTC TTT CCT AAA TCT GAC TCC AAA GCT 912 

Thr Lys Val Ala Trp Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala 
290 295 300 



io CTT GAC CTG CTG GAC CGG ATG TTA ACC TTC AAC CCA AAC AAG CGC ATC 96 0 

Leu Asp Leu Leu Asp Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie 
305 310 315 320 

ACA GTA GAG GAA GCG CTG GCT CAC CCT TAC CTG GAA CAG TAC TAC GAT 100 8 

20 Thr Val Glu Glu Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp 

325 330 335 

CCG ACA GAT GAG CCA GTG GCC GAG GAG CCA TTC ACC TTC GAC ATG GAG 105 6 

Pro Thr Asp Glu Pro Val Ala Glu Glu Pro Phe Thr Phe Asp Met Glu 
25 340 345 350 

CTG GAT GAC CTC CCC AAG GAG CGG CTG AAG GAG TTG ATC TTC CAG GAG 1104 

Leu Asp Asp Leu Pro Lys Glu Arg Leu Lys Glu Leu lie Phe Gin Glu 

355 360 365 

30 

ACA GCC CGC TTC CAG CCA GGG GCG CCA GAG GGC CCC GGG CGC GCC ATG 115 2 

Thr Ala Arg Phe Gin Pro Gly Ala Pro Glu Gly Pro Gly Arg Ala Met 
370 375 380 

35 AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT GAA 12 0 0 
Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu 
385 390 395 400 

TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT AGT GGA GAG GGT 12 4 8 
40 Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Gly Glu Gly 

405 410 415 

GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC ACT 12 9 6 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 
45 420 425 430 

ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC ACT ACT CTC ACT 13 44 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Th-^ ^ ^ 

4 3 5 



4 60 



GAC TTT TTC T-JkG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA ACT 144 0 

Asp Phe Phe Lys Eer Ala Met Pro Glu Glv Tvr ^- ' 
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465 470 475 4B0 

ATA TTT TAC AAA GAT GAC GGG AAC TAG AAG ACA CGT GCT GAA GTC AAG 14 8 8 
lie Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

485 490 495 

TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT GAT 153 6 
Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp 

500 505 510 

TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA ATG GAA TAC AAT TAT 1584 
Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu Tyr Asn Tyr 
515 520 525 

15 AAC TCA CAT AAT GTA TAC ATC ATG GCA GAC AAA CCA AAG AAT GGC ATC 163 2 
Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro Lys Asn Gly lie 
530 535 540 

AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AAA GAT GGA AGC GTT CAA 16 8 0 

20 Lys Val Asn Phe Lys He Arg His Asn He Lys Asp Gly Ser Val Gin 
545 550 555 560 

TTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT GTC 172 8 
Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 
25 565 570 575 

CTT TTA CCA GAC AAC CAT TAC CTG TCC ACG CAA TCT GCC CTT TCC AAA 1776 

Leu Leu Pro Asp Asn Kis Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

580 585 590 

30 

GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT GAG TTT GTA ACA 1824 

Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val Thr 

595 600 605 

35 GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA CCT CAG GAG T 1673 
Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys Pro Gin Glu 
610 615 620 



40 



AA 

(2) INFORMATION FOR SEQ ID NO: 57 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



X.eL Al.d. Ala Ala Ala Ala Ala Pre Gly Gly Gly Gly Gly Glu Pro Arg 
55 1 5 10 15 

ly Thr Ala Gly Val Val Fro Val V--^. r^-- " ^ " 



1875 
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20 25 30 

Lys Gly Gin Pro Phe Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr 

35 40 45 

lie Gly Glu Gly Ala Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val 
5 50 55 60 

Arg Lys Thr Arg Val Ala lie Lys Lys He Ser Pro Phe Glu His Gin 
65 70 75 80 

Thr Tyr Cys Gin Arg Thr Leu Arg Glu He Gin He Leu Leu Arg Phe 

85 90 95 

10 Arg His Glu Asn Val He Gly He Arg Asp He Leu Arg Ala Pro Thr 

100 105 110 

Leu Glu Ala Met Arg Asp Val Tyr He Val Gin Asp Leu Met Glu Thr 

115 120 125 

Asp Leu Tyr Lys Leu Leu Lys Ser Gin Gin ijeu Ser Asn Asp His He 
15 130 135 140 

Cys Tyr Phe Leu Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser 
145 150 155 160 

Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu Leu He Asn 

165 170 175 

20 Thr Thr Cys Asp Leu Lys He Cys Asp Phe Gly Leu Ala Arg He Ala 

180 185 190 

Asp Pro Glu His Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr 

195 200 205 

Arg Trp Tyr Arg Ala Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr 
25 210 215 220 

Lys Ser He Asp He Trp Ser Val Gly Cys He Leu Ala Glu Met Leu 
225 230 235 240 

Ser Asn Arg Pro He Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn 

245 250 255 

30 His He Leu Gly He Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys 

260 265 270 

He He Asn Met Lys Ala Arg Asn Tyr Leu Gin Ser Leu Pro Ser Lys 

275 280 285 

Thr Lys Val Ala Trp Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala 
35 290 295 300 

Leu Asp Leu Leu Asp Arg Met Leu Thr Phe Asn Pro Asn Lys Arg He 
305 310 315 320 

Thr Val Glu Glu Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp 

325 330 335 

40 Pro Thr Asp Glu Pro Val Ala Glu Glu Pro Phe Thr Phe Asp Met Glu 

340 345 350 

Leu Asp Asp Leu Pro Lys Glu Arg Leu Lys Glu Leu He Phe Gin Glu 

355 360 365 

Thr Ala Arg Phe Gin Pro Gly Ala Pro Glu Gly Pro Gly Arg Ala Met 
45 370 375 380 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 
385 390 395 400 

Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Glv Gli: 

4 0"^ 



■t -T ■. . ^ D 

/ ' r\ \ Cys Phe Ser Arg Tyr Pro Asp His Met Lys Glr. His 

55 450 455 460 

Asp Phe Phe Lys Ser Ala Ket Pro G;u GVv Tvr ' ■ ■ - ^ - ■ ^ ■ 
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465 470 475 480 

lie Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

485 490 495 

Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp 
5 500 505 510 

Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Met Glu Tyr Asn Tyr 

515 520 525 

Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Pro Lys Asn Gly lie 
530 535 540 

10 Lys Val Asn Phe Lys lie Arg His Asn He Lys Asp Gly Ser Val Gin 
545 550 555 560 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 

565 570 575 

Leu Leu Pro Asp Asn His Tyr Lpi_i Ser Thr Gla Ser Ala Leu Ser Lys 

580 585 590 

Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val Thr 

595 600 605 

Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys Pro Gin Glu 
610 615 620 



1 c 



20 



35 



45 



(2) INFORMATION FOR SEQ ID N0:5B: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1815 base pairs 

25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLEOHjE TYPE: CDNA 
30 (ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1811 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ATG GCG GCG GCG GCG GCG GCG GGC CCG GAG ATG GTC CGC GGG CAG GTG 4 8 

Met Ala Ala Ala Ala Ala Ala Gly Pro Glu Met Val Arg Gly Gin Val 

40 1 5 10 15 

TTC GAC GTG GGG CCG CGC TAC ACT AAT CTC TCG TAC ATC GGA GAA GGC 96 

Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr He Gly Glu Gly 

20 25 30 



GCC TAC GGC ATG GTT TGT TCT GCT TAT GAT AAT CTC AAC AAA GTT CGA 14 4 

Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Leu Asn Lys Val Arg 
35 40 45 



.'1.VJ>^ .,i'^J v^M.Lj M i /-V M./-\M. rti^ ;^ijA v_lVJ ii.^^ Mo A ^f\x rtJ-i.^ Z, ^ 

55 Arg Thr Leu Arg Glu He Lys He Leu Leu Arg Phe Arg His Glu Asn 

6 5 7 0 7 5 - " 
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ATC ATC GGC ATC AAT GAC ATC ATC CGG GCA CCA ACC ATT GAG CAG ATG 28 8 

lie lie Gly lie Asn Asp lie lie Arg Ala Pro Thr lie Glu Gin Met 

85 90 95 

5 

AAA GAT GTA TAT ATA GTA CAG GAC CTC ATG GAG ACA GAT CTT TAC AAG 33 6 

Lys Asp Val Tyr lie Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys 

100 105 110 

10 CTC TTG AAG ACA CAG CAC CTC AGC AAT GAT CAT ATC TGC TAT TTT CTT 3 84 

Leu Leu Lys Thr Gin His Leu Ser Asn Asp His lie Cys Tyr Phe Leu 
115 120 125 

TAT CAG ATC CTG AG A QGA TTA AAG TAT ATA CAT TCA GCT AAT GTT CTG 43 2 

15 Tyr Gin lie Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu 
130 135 140 

CAC CGT GAC CTC AAG CCT TCC AAC CTC CTG CTG AAC ACC ACT TGT GAT 4 80 

His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr Cys Asp 
20 145 150 155 160 

CTC AAG ATC TGT GAC TTT GGC CTT GCC CGT GTT GCA GAT CCA GAC CAT 52 8 

Leu Lys He Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro Asp His 

165 170 175 

25 

GAT CAT ACA GGG TTC TTG ACA GAG TAT GTA GCC ACG CGT TGG TAC AGA 576 
Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg 

180 185 190 

30 GCT CCA GAA ATT ATG TTG AAT TCC AAG GGT TAT ACC AAG TCC ATT GAT 624 
Ala Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp 
195 200 205 

ATT TGG TCT GTG GGC TGC ATC CTG GCA GAG ATG CTA TCC AAC AGG CCT 672 
35 He Trp Ser Val Gly Cys He Leu Ala Glu Met Leu Ser Asn Arg Pro 
210 215 220 

ATC TTC CCA GGA AAG CAT TAC CTT GAC CAG CTG AAT CAC ATC CTG GGT 72 0 

He Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His He Leu Gly 
40 225 230 235 240 

ATT CTT GGA TCT CCA TCA CAG GAA GAT CTG AAT TGT ATA ATA AAT TTA 768 
He Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys He He Asn Leu 

245 250 255 



45 



AAA GCT AGA AAC TAT TTG CTT TCT CTC CCG CAC AAA AAT AAG GTG CCG 816 
Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys Val Pro 

260 265 270 



GAT AAA ATG TTG ACA TTT AAC CCT CAC hJ\G AGG ATT GAA GTT GAA CAG 9 12 

55 Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg He Glu Val Glu Gin 
2 9 0 2 9 5 " ■'^ 
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GCT CTG GCC CAC CCG TAC CTG GAG CAG TAT TAT GAG CCA AGT GAT GAG 960 

Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Ser Asp Glu 

305 310 315 320 

5 

CCC ATT GCT GAA GCA CCA TTC AAG TTT GAC ATG GAG CTG GAC GAC TTA 10 0 8 

Pro lie Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp Asp Leu 

325 330 335 

10 CCT AAG GAG AAG CTC AAA GAA CTC ATT TTT GAA GAG ACT GCT CGA TTC 105 6 

Pro Lys Glu Lys Leu Lys Glu Leu He Phe Glu Glu Thr Ala Arg Phe 

340 345 350 

CAG CCA GGA TAC AGA TCT ATG GAT CCA CCG GTC GCC ACC ATG GTG AGC 1104 
15 Gin Pro Gly Tyr Arg Ser Met Asp Pro Pro Val Ala Thr Met Val Ser 

355 360 365 

AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG 1152 
Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu 
20 370 375 380 

GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG 12 0 0 

Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 

385 390 395 400 

25 

GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC 124 8 

Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

405 410 415 

30 GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC 12 96 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 430 

GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC 13 4 4 

35 Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

435 440 445 

TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC 13 92 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 
40 450 455 460 

TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC 144 0 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
465 470 475 480 



45 



GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC 14 8 8 

Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe 

485 490 '^^"^ 



A:;c: CJAC A>\C GTC tat ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG 15 84 

55 Ser His Asn Val Tyr He Met: Ala Asp Lys Gin Lys Asn Gly He Lys 

515 52 C 



SUBSTITUTE SHEET (RULE 25) 



wo 98/45704 



PCT/DK98/00145 

125 



GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC 163 2 

Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val Gin Leu 
530 535 540 

5 

GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG 168 0 

Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 
545 550 555 560 

10 CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC 172 8 

Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 

565 570 575 

CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC 177 6 

15 Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 

580 585 590 

GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AA GTAA 1815 
Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
20 595 600 



(2) INFORMATION FOR SEQ ID NO: 59: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

CO STP.AjoDEDJIESS ; single 
CD) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: protein 
Cv) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

35 

Met Ala Ala Ala Ala Ala Ala Gly Pro Glu Met Val Arg Gly Gin Val 

15 10 15 

Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr He Gly Glu Gly 

20 25 30 

40 Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Leu Asn Lys Val Arg 

35 40 45 

Val Ala He Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Gin 

50 55 60 

Arg Thr Leu Arg Glu He Lys He Leu Leu Arg Phe Arg His Glu Asn 
45 65 70 75 80 

He He Gly He Asn Asp He He Arg Ala Pro Thr He Glu Gin Met 

85 90 95 

Lys Asp Val Tyr He Val Gin Asp Leu Met Glu Thr Asp Leu Tvr T 



^4 0 



i-i . f; Arg Asp L^eu I.yn Pro Ser Asn Leu Leu Leu Asn Thr Thr Cys Asp 
55 145 15C 155 160 

Leu Lys He Cys Asp Phe Gly Leu Ala hrn '7.1^ a A^-^ ' - "' 
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165 170 175 

Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg 

180 185 190 

Ala Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp 
5 195 200 205 

He Trp Ser Val Gly Cys He Leu Ala Glu Met Leu Ser Asn Arg Pro 

210 215 220 

He Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His He Leu Gly 
225 230 235 240 

10 He Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys He He Asn Leu 

245 250 255 

Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys Val Pro 

260 265 270 

Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp Leu Leu 
15 275 280 285 

Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg He Glu Val Glu Gin 

290 295 300 

Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Ser Asp Glu 
305 310 315 320 

20 Pro He Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp Asp Leu 

325 330 335 

Pro Lys Glu Lys Leu Lys Glu Leu He Phe Glu Glu Thr Ala Arg Phe 

340 345 350 

Gin Pro Gly Tyr Arg Ser Met Asp Pro Pro Val Ala Thr Met Val Ser 
25 355 360 365 

Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu 

370 375 380 

Asp Gly Asp Val A.sn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 
385 390 395 400 

30 Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

405 410 415 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 430 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 
35 435 440 445 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 

450 455 460 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
465 470 475 480 

40 Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe 

485 490 495 

Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn 

500 505 510 

Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys 
45 515 520 525 

Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

530 535 540 

Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp O'v P^- '.^n ^ ^ 
545 ^ ' ^ 



55 



^ Q P 
-J ^ w 



i\ 1 a 



595 



'hr Leu Gly Met Asp Glu Leu 

600 



Lv 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2511 base pairs 

5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
10 (ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . , .2508 
(D) OTHER INFORMATION: 



I 0 



45 



xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



ATG GAG CTG GAA AAC ATC GTG GCC AAC ACG GTC TTG CTG AAA GCC AGG 4 8 

Met Glu Leu Glu Asn lie Val Ala Asn Thr Val Leu Leu Lys Ala Arg 
20 1 5 10 15 

GAA GGG GGC GGA GGA AAG CGC AAA GGG AAA AGC AAG AAG TGG AAA GAA 9 6 

Glu Gly Gly Gly Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu 

20 25 30 

25 

ATC CTG AAG TTC CCT CAC ATT AGC CAG TGT GAA GAC CTC CGA AGG ACC 144 

lie Leu Lys Phe Pro His lie Ser Gin Cys Glu Asp Leu Arg Arg Thr 
35 40 45 

30 ATA GAC AGA GAT TAC TGC AGT TTA TGT GAC AAG CAG CCA ATC GGG AGG 192 

lie Asp Arg Asp Tyr Cys Ser Leu Cys Asp Lys Gin Pro lie Gly Arg 
50 55 60 

CTG CTT TTC CGG CAG TTT TGT GAA ACC AGG CCT GGG CTG GAG TGT TAC 24 0 

35 Leu Leu Phe Arg Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr 

65 70 75 80 

ATT CAG TTC CTG GAC TCC GTG GCA GAA TAT GAA GTT ACT CCA GAT GAA 28 8 

lie Gin Phe Leu Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu 
40 85 90 95 

AA;^. CTG GGA GAG AAA GGG AAG GAA ATT ATG ACC AAG TAC CTC ACC CCA 33 6 

Lys Leu Gly Glu Lys Gly Lys Glu lie Met Thr Lys Tyr Leu Thr Pro 

100 105 110 



AAG TCC CCT GTT TTC ATA GCC CAA GTT GGC CAA GAC CTG GTC TCC CAG 3 84 

Lys Ser Pro Val Phe lie Ala Gin Val Gly Gin Asp Leu Val Ser Gin 
115 120 125 



J. o - o * \_ L-M'^ ^MLi i M.'v^ V- J. Aoo vjv-Jrt Um-K i J. Lj/-v^ 

55 Cys Ala Gin Ser Val His Glu Tyr Leu Arg Gly Glu Pro Phe Kis Glu 
145 150 155 
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TAT CTG GAC AGC ATG TTT TTT GAG CGC TTT CTC GAG TGG AAG TGG TTG 52 8 

Tyr Leu Asp Ser Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu 

165 170 175 

5 

GAA AGG CAA CCG GTG ACC AAA AAC ACT TTC AGG GAG TAT CGA GTG CTA 576 
Glu Arg Gin Pro Val Thr Lys Asn Thr Phe Arg Gin Tyr Arg Val Leu 

180 185 190 



10 GGA AAA GGG GGC TTC GGG GAG GTG TGT GCC TGC GAG GTT CGG GCC AGG 624 
Gly Lys Gly Gly Phe Gly Glu Val Cys Ala Cys Gin Val Arg Ala Thr 
195 200 205 



GGT AAA ATG TAT GCC TGC AAG CUC TTG GAG AAG AAG AGG ATC AAA AAG 67 2 

15 Gly Lys Met Tyr Ala Cys Lys Arg Leu Glu Lys Lys Arg lie Lys Lys 
210 215 220 



AGG AAA GGG GAG TCC ATG GCC CTC AAT GAG AAG CAG ATC CTC GAG AAG 72 0 

Arg Lys Gly Glu Ser Met Ala Leu Asn Glu Lys Gin lie Leu Glu Lys 
20 225 230 235 240 



GTC AAC AGT CAG TTT GTG GTC AAC CTG GCC TAT GCC TAG GAG ACC AAG 768 

Val Asn Ser Gin Phe Val Val Asn Leu Ala Tyr Ala Tyr Glu Thr Lys 

245 250 255 

25 

GAT GCA CTG TGC TTG GTC CTG ACC ATC ATG AAT GGG GGT GAC CTG AAG 816 

Asp Ala Leu Cys Leu Val Leu Thr lie Met Asn Gly Gly Asp Leu Lys 

260 265 270 

30 TTC CAC ATC TAG AAC ATG GGC AAC CCT GGC TTC GAG GAG GAG CGG GCC 864 
Phe His lie Tyr Asn Met Gly Asn Pro Gly Phe Glu Glu Glu Arg Ala 
275 280 285 



TTG TTT TAT GCG GCA GAG ATC CTC TGC GGC TTA GAA GAC CTC CAC CGT 912 
35 Leu Phe Tyr Ala Ala Glu lie Leu Cys Gly Leu Glu Asp Leu His Arg 
290 295 300 



GAG AAC ACC GTC TAC CGA GAT CTG AAA CCT GAA AAC ATC CTG TTA GAT 9 60 

Glu Asn Thr Val Tyr Arg Asp Leu Lys Pro Glu Asn lie Leu Leu Asp 
40 305 310 315 320 



GAT TAT GGC CAC ATT AGG ATC TCA GAC CTG GGC TTG GCT GTG AAG ATC 10 08 

Asp Tyr Gly His lie Arg lie Ser Asp Leu Gly Leu Ala Val Lys lie 

325 330 335 

45 

CCC GAG GGA GAC CTG ATC CGC GGC CGG GTG GGC ACT GTT GGC TAC ATG 105 6 

Pro Glu Gly Asp Leu lie Arg Gly Arg Val Gly Thr Val Gly Tyr Met 

340 345 ^'^^ 



TGG GGC GTT GGC TGC CTC ATC TAT GAG ATG ATC GAG GGC CAG TCG CCG 
55 Trp Gly Leu Gly Cys Leu lie Tyr Glu Met He Glu Gly Gin Ser Pre 
370 T"^^ 
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TTC CGC GGC COT AAG GAG AAG GTG AAG CGG GAG GAG GTG GAG CGC CGG 12 0 0 

Phe Arg Gly Arg Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg 

385 390 395 400 

5 

GTG CTG GAG ACG GAG GAG GTG TAG TCC CAC AAG TTC TCC GAG GAG GCC 124 8 

Val Leu Glu Thr Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala 

405 410 415 

10 AAG TCC ATC TGC AAG ATG CTG CTC ACG AAA GAT GCG AAG CAG AGG CTG 12 9 6 

Lys Ser lie Cys Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu 

420 425 430 

GGC ran cag gag gag ggg gct gca gag gtc aag aga cac ccc ttc ttc 1344 

15 Gly Cys Gin Glu Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe 

435 440 445 

AGG AAC ATG AAC TTC AAG CGC TTA GAA GCC GGG ATG TTG GAC CCT CCC 13 92 
Arg Asn Met Asn Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro 
20 450 455 460 

TTC GTT CCA GAC CCC CGC GCT GTG TAC TGT AAG GAC GTG CTG GAC ATC 144 0 

Phe Val Pro Asp Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp He 

465 470 475 480 

25 

GAG CAG TTC TCC ACT GTG AAG GGC GTC AAT CTG GAC CAC ACA GAC GAC 14 8 8 

Glu Gin Phe Ser Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp 

485 490 495 

30 GAC TTC TAC TCC AAG TTC TCC ACG GGC TCT GTG TCC ATC CCA TGG CAA 15 3 6 
Asp Phe Tyr Ser Lys Phe Ser Thr Gly Ser Val Ser He Pro Trp Gin 

500 505 510 

AAC GAG ATG ATA GAA ACA GAA TGC TTT AAG GAG CTG AAC GTG TTT GGA 15 84 

35 Asn Glu Met He Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly 

515 520 525 

CCT AAT GGT ACC CTC CCG CCA GAT CTG AAC AGA AAC CAC CCT CCG GAA 163 2 

Pro Asn Gly Thr Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu 
40 530 535 540 

CCG CCC AAG AAA GGG CTG CTC CAG AGA CTC TTC AAG CGG CAG CAT CAG 16 8 0 

Pro Pro Lys Lys Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin 
545 550 555 560 



45 



AAC AAT TCC AAG AGT TCG CCC AGC TCC AAG ACC AGT TTT AAC CAC CAC 17 2 8 

Asn Asn Ser Lys Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His 

565 57n -"^^ 



CCA CCG GTC CCC ACC ATG GTG AGC AAG GGC CAG GAG CTG TTC ACC GGG 1B24 
55 Pro Pro Val Ala Thr r^.et: Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 

5 95 (^^^r 
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GTG GTG CCC ATC CTG 
Val Val Pro lie Leu 
610 

5 

TTC AGC GTG TCC GGC 
Phe Ser Val Ser Gly 
625 

10 ACC CTG AAG TTC ATC 
Thr Leu Lys Phe lie 

645 



GTC GAG CTG GAC GGC GAC 
Val Glu Leu Asp Gly Asp 
615 

GAG GGC GAG GGC GAT GCC 
Glu Gly Glu Gly Asp Ala 
630 635 

TGC ACC ACC GGC AAG CTG 
Cys Thr Thr Gly Lys Leu 

650 



GTA AAC GGC CAC AAG 1872 

Val Asn Gly His Lys 

620 

ACC TAG GGC AAG CTG 192 0 
Thr Tyr Gly Lys Leu 

640 

CCC GTG CCC TGG CCC 1966 
Pro Val Pro Trp Pro 

655 



ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAG 2 016 

15 Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

660 665 670 



CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 2 064 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
20 675 6B0 685 



GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 2112 

Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 
690 695 700 

AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC 2160 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

705 710 715 720 



30 ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 22 08 

lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 

725 730 735 



CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 2 2 56 

35 His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala 

740 745 750 



GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 2 3 04 

Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 
40 755 760 765 



ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 23 52 

lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
770 775 780 

CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC 24 0 0 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
785 790 79=^ 



GTC CT3 CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 24 96 

55 Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp 

82 0 ^r- 
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GAG CTG TAG AAG TAA 2 511 

Glu Leu Tyr Lys 
835 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 836 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

15 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

20 Met Glu Leu Glu Asn He Val Ala Asn Thr Val Leu Leu Lys Ala Arg 
15 10 15 

Glu Gly Gly Gly Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu 

20 25 30 

He Leu Lys Phe Pro His He Ser Gin Cys Glu Asp Leu Arg Arg Thr 
25 35 40 45 

He Asp Arg Asp Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg 

50 55 60 

Leu Leu Phe Arg Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr 
65 70 75 80 

30 He Gin Phe Leu Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu 

85 90 95 

Lys Leu Gly Glu Lys Gly Lys Glu He Met Thr Lys Tyr Leu Thr Pro 

100 105 110 

Lys Ser Pro Val Phe He Ala Gin Val Gly Gin Asp Leu Val Ser Gin 
35 115 120 125 

Thr Glu Glu Lys Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala 

130 135 140 

Cys Ala Gin Ser Val His Glu Tyr Leu Arg Gly Glu Pro Phe His Glu 
145 150 155 160 

40 Tyr Leu Asp Ser Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu 

165 170 175 

Glu Arg Gin Pro Val Thr Lys Asn Thr Phe Arg Gin Tyr Arg Val Leu 

180 185 190 

Gly Lys Gly Gly Phe Gly Glu Val Cys Ala Cys Gin Val Arg Ala Thr 
45 195 200 205 

Gly Lys Met Tyr Ala Cys Lys Arg Leu Glu Lys Lys Arg He Lys Lys 

210 215 220 

Arg Lys Gly Glu Ser Met Ala Leu Asn Glu Lys Gin Tie Leu ^-V: '^vr 
225 



^ 



b 



lie Tyr Asn MpV Gly Ann Pro !",ly 
55 275 280 

Leu Phe Tyr Ala Ala Glu He Leu C\'n T.^ 



Pne G 



' u G 1 u Glu Arg A \. a 
285 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/T>K98/00145 



132 

290 295 300 

Glu Asn Thr Val Tyr Arg Asp Leu Lys Pro Glu Asn lie Leu Leu Asp 
305 310 315 320 

Asp Tyr Gly His lie Arg lie Ser Asp Leu Gly Leu Ala Val Lys lie 
5 325 330 335 

Pro Glu Gly Asp Leu lie Arg Gly Arg Val Gly Thr Val Gly Tyr Met 

340 345 350 

Ala Pro Glu Val Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr 
355 360 365 

10 Trp Gly Leu Gly Cys Leu lie Tyr Glu Met lie Glu Gly Gin Ser Pro 
370 375 380 

Phe Arg Gly Arg Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg 
385 390 395 400 

Val Leu Glu Thr Glu Glu Val Tyr Ser Kia Lys Phe Ser Glu Glu Ala 
15 405 410 415 

Lys Ser lie Cys Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu 

420 425 430 

Gly Cys Gin Glu Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe 
435 440 445 

20 Arg Asn Met Asn Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro 
450 455 460 

Phe Val Pro Asp Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp lie 
465 470 475 480 

Glu Gin Phe Ser Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp 
25 485 490 495 

Asp Phe Tyr Ser Lys Phe Ser Thr Gly Ser Val Ser lie Pro Trp Gin 

500 505 510 

Asn Glu Met lie Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly 
515 520 525 

30 Pro Asn Gly Thr Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu 
530 535 540 

Pro Pro Lys Lys Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin 
545 550 555 560 

Asn Asn Ser Lys Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His 
35 565 570 575 

lie Asn Ser Asn His Val Ser Ser Asn Ser Thr Gly Ser Ser Arg Asp 

580 585 590 

Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
595 600 605 

40 Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
610 615 620 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
625 630 635 640 

Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
45 645 650 655 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

660 665 670 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro GV: 
6 7 s ^ 



^^e u Leu Lys Gl'/ He Asp Fhe Lys Glu Asp Gly Asn lie Leu Gly 
55 725 730 735 

His LyB Leu Glu Tyr Asn Tyr Asn Ser His Asn '/al T\-r 
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740 745 750 

Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 

755 760 765 

lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
5 770 775 780 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
785 790 795 800 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

805 810 815 

10 Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp 

820 825 830 

Glu Leu Tyr Lys 
835 

15 (2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . . 1890 
(D) OTHER INFORMATION: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

ATG AGC AGA AGC AAG CGT GAG AAC AAT TTT TAT AGT GTA GAG ATT GGA 4 8 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu lie Gly 

15 10 15 

35 

GAT TCT ACA TTC ACA GTC CTG AAA CGA TAT CAG AAT TTA AAA CCT ATA 96 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro lie 

20 25 30 

40 GGC TCA GGA GCT CAA GGA ATA GTA TGC GCA GCT TAT GAT GCC ATT CTT 14 4 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

GAA AGA AAT GTT GCA ATC AAG AAG CTA AGC CGA CCA TTT CAG AAT CAG 19 2 

45 Glu Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

ACT CAT GCC AAG CGG GCC TAG AGA GAG CTA GTT CTT ATG AAA TGT ^^'^^ 

Thr Hin A"! -i t v-r- r. • 



B 5 



90 



l-::e Thr Pro Gin Ir/s 

95 



55 



TA G/\A GIkA TTT CAA GAT 



A. 
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Ser Leu Glu Glu Phe Gin Asp Val Tyr lie Val Met Glu Leu Met Asp 

100 105 110 



GCA AAT CTT TGC CAA GTG ATT CAG ATG GAG CTA GAT CAT GAA AGA ATG 
Ala Asn Leu Cys Gin Val lie Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 



384 



10 



TCC TAG CTT CTC TAT CAG ATG CTG TGT GGA ATG AAG CAC CTT CAT TCT 
Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser 
130 135 140 



432 



lb 



GCT GGA ATT ATT CAT CGG GAG TTA AAG CCC AGT AAT ATA GTA GTA AAA 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 1^0 155 160 

TCT GAT TGC ACT TTG AAG ATT CTT GAC TTC GGT CTG GCC AGG ACT GCA 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 

165 170 175 



480 



528 



20 GGA ACG AGT TTT ATG ATG ACG CCT TAT GTA GTG ACT CGC TAC TAC AGA 
Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 

180 185 190 



576 



GCA CCC GAG GTC ATC CTT GGC ATG GGC TAC AAG GAA AAC GTG GAT TTA 
25 Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 

195 200 205 



624 



30 



TGG TCT GTG GGG TGC A.TT ATG GGA GAA ATG GTT TGC CAC AAA ATC CTC 
Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 



672 



35 



TTT CCA GGA AGG GAC TAT ATT GAT CAG TGG AAT AAA GTT ATT GAA CAG 

Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 

225 230 235 240 

CTT GGA ACA CCA TGT CCT GAA TTC ATG AAG AAA CTG CAA CCA ACA GTA 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 

245 250 255 



720 



768 



40 AGG ACT TAC GTT GAA AAC AGA CCT AAA TAT GCT GGA TAT AGC TTT GAG 
Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 

260 265 270 



816 



AAA CTC TTC CCT GAT GTC CTT TTC CCA GCT GAC TCA GAA CAC AAC AAA 
45 Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 

275 280 285 



864 



CTT AAA GCC AGT CAG GCA AGG GAT TTG TTA TCC AAA ATG CTG GTA ATA 

Leu Lvs Ala ^i- - ' ' — ^ 



55 



r-.;;;.. ^ . ... /^^a i.eu G ^ n His ;^ro Tvr 



ATC AAT GTC 



3 10 



G TAT GAT CC 



315 



3 20 



TCT GAA GCA G 
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lie Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Pro Pro Pro Lys 

325 330 335 

ATC CCT GAG AAG CAG TTA GAT GAA AGG GAA CAC ACA ATA GAA GAG TGG 105 6 

5 He Pro Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp 

340 345 350 

AAA GAA TTG ATA TAT AAG GAA GTT ATG GAC TTG GAG GAG AG A ACC AAG 1104 
Lys Glu Leu He Tyr Lys Glu Val Met Asp Leu Glu Glu Arg Thr Lys 
10 355 360 365 

AAT GGA GTT ATA CGG GGG CAG CCC TCT CCT TTA GCA CAG GTG CAG CAG 115 2 

Asn Gly Val He Arg Gly Gin Pro Ser Pro Leu Ala Gin Val Gin Gin 
370 375 380 

15 

TGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC 120 0 

Trp Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
385 390 395 400 

20 ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC 124 6 
Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

405 410 415 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC 12 96 
25 His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 

420 425 430 

AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 1344 
Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 
30 435 440 445 

TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC 13 92 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 

450 455 460 

35 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 14 4 0 

Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 

465 470 475 480 

40 CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 14 8 8 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 

485 490 495 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 15 3 6 

45 Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 

500 505 510 

AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC AT^ ^ ^ ^ ' 

Asn Ara Tie v ' 



55 



5 3 C 



5 3 5 



As : . e : n is As a ^ . y r x 
54 0 



ATG GCC GAC AAG CAG Aj\G AAC GGC ATC AAG G^G AAC 



1 ^1 ^ ^ 
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Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg 
545 550 555 560 

CAC AAC ATC GAG GAG GGC AGC GTG GAG CTC GCC GAC CAC TAG CAG GAG 172 8 
5 His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

565 570 575 

AAC ACC GCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAG 177 6 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
10 580 585 590 

CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT 1824 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
595 500 605 

15 

CAC ATG GTG CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC 18 72 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 
610 615 620 

20 ATG GAC GAG CTG TAG AAG TAA 18 93 

Met Asp Glu Leu Tyr Lys 
625 630 



25 (2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



Met 


Ser 


Arg 


Ser 


Lys 


Arg 


Asp 


Asn 


Asn 


Phe 


Tyr 


Ser 


Val 


Glu 


He 


Gly 


1 








5 










10 










15 




Asp 


Ser 


Thr 


Phe 


Thr 


Val 


Leu 


Lys 


Arg 


Tyr 


Gin 


Asn 


Leu 


Lys 


Pro 


He 








20 










25 










30 






Gly 


Ser 


Gly 


Ala 


Gin 


Gly 


He 


Val 


Cys 


Ala 


Ala 


Tyr 


Asp 


Ala 


He 


Leu 






35 










40 










45 








Glu 


Arg 


Asn 


Val 


Ala 


He 


Lys 


Lys 


Leu 


Ser 


Arg 


Pro 


Phe 


Gin 


Asn 


Gin 




50 










55 










60 










Thr 


His 


Ala 


Lys 


Arg 


Ala 


Tyr 


Arg 


Glu 


Leu 


Val 


Leu 


Met 


Lys 


Cys 


Val 


65 










70 










75 










80 


Asn 


His 


Lys 


Asn 


He 


He 


Gly 


Leu 


Leu 


Asn 


Val 


Phe 











X z ^ 



oer Tyr Leu Leu Tyr Gin Met: Leu CyG Gly He Lys His Leu His Ser 
55 130 135 140 

Ala Gly He He His Arq Asp Le\: Lvr Pro S^r Am ^ ^ ------- 
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145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 

165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
5 180 165 190 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 

195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

10 Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 

245 250 255 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
15 260 265 270 

Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 

275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He 
290 295 300 

20 Asp Ala Ser Lys Arg He Ser Val Asp Glu Ala Leu Gin His Pro Tyr 
305 310 315 320 

He Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Pro Pro Pro Lys 

325 330 335 

He Pro Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp 
25 340 345 350 

Lys Glu Leu He Tyr Lys Glu Val Met Asp Leu Glu Glu Arg Thr Lys 

355 360 365 

Asn Gly Val He Arg Gly Gin Pro Ser Pro Leu Ala Gin Val Gin Gin 
370 375 380 

30 Trp Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
385 390 3 95 400 

Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

405 410 415 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
35 420 425 430 

Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 

435 440 445 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
450 455 460 

40 Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
465 470 475 480 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 

485 490 495 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
^5 500 505 510 

Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 

515 520 525 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val '^'.'^ 
530 ' ^ ^ 



55 



r^^::; :"::r Pro lie Glv Asp Gly Pro Val Leu Leu Pro Asp Asn Kis Tyr 

580 585 590 

Leu Ser Thr Gin Ser Ala Leu Ser Irvr. Ago P^r^ A^r^ ■ ■ - ' ■ ' ■ 
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595 600 605 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 

610 615 620 

Met Asp Glu Leu Tyr Lys 
5 625 630 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1818 
20 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

ATG TCT CAG GAG AGG CCC ACG TTC TAG CGG CAG GAG CTG AAC AAG ACA 4 8 

25 Met Ser Gin Glu Arg Pro Thr Phe Tyr Arg Gin Glu Leu Asn Lys Thr 
15 10 15 

ATC TGG GAG GTG CCC GAG CGT TAG CAG AAC CTG TCT CCA GTG GGC TCT 96 
He Trp Glu Val Pro Glu Arg Tyr Gin Asn Leu Ser Pro Val Gly Ser 
30 20 25 30 

GGC GCC TAT GGC TCT GTG TGT GCT GCT TTT GAC ACA AAA ACG GGG TTA 144 

Gly Ala Tyr Gly Ser Val Cys Ala Ala Phe Asp Thr Lys Thr Gly Leu 
35 40 45 

35 

CGT GTG GCA GTG AAG AAG CTC TCC AGA CCA TTT CAG TCC ATC ATT CAT 192 

Arg Val Ala Val Lys Lys Leu Ser Arg Pro Phe Gin Ser He He His 
50 55 60 

40 GCG AAA AGA ACC TAG AGA GAA CTG CGG TTA CTT AAA CAT ATG AAA CAT 24 0 

Ala Lys Arg Thr Tyr Arg Glu Leu Arg Leu Leu Lys His Met Lys His 
65 70 75 80 

GAA AAT GTG ATT GGT CTG TTG GAC GTT TTT ACA CCT GCA AGG TCT CTG 2 88 

45 Glu Asn Val He Gly Leu Leu Asp Val Phe Thr Pro Ala Arg Ser Leu 

85 90 95 

GAG GAA TTC AAT GAT GTG TAT CTG GTG ACC CAT CTC ATG GGG GCA GAT 



55 



TTC 



115 12 0 

ATC TAC CAA ATT CTC CGA GGT CTA AAG TAT 



A. 

125 



ATA 
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Phe Leu lie Tyr Gin lie Leu Arg Gly Leu Lys Tyr lie His Ser Ala 
130 135 140 

GAG ATA ATT CAC AGG GAG CTA AAA CCT AGT AAT CTA GCT GTG AAT GAA 48 0 

5 Asp lie lie His Arg Asp Leu Lys Pro Ser Asn Leu Ala Val Asn Glu 
145 150 155 160 

GAG TGT GAG CTG AAG ATT CTG GAT TTT GGA CTG GCT CGG CAC ACA GAT 52 8 

Asp Cys Glu Leu Lys lie Leu Asp Phe Gly Leu Ala Arg His Thr Asp 
10 165 170 175 

GAT GAA ATG ACA GGC TAG GTG GCC ACT AGG TGG TAG AGG GCT CCT GAG 57 6 

Asp Glu Met Thr Gly Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu 

180 185 190 

15 

ATC ATG CTG AAC TGG ATG CAT TAG AAG GAG ACA GTT GAT ATT TGG TCA 624 

lie Met Leu Asn Trp Met His Tyr Asn Gin Thr Val Asp lie Trp Ser 
195 200 205 

20 GTG GGA TGC ATA ATG GCC GAG CTG TTG ACT GGA AGA ACA TTG TTT CCT 672 
Val Gly Cys lie Met Ala Glu Leu Leu Thr Gly Arg Thr Leu Phe Pro 
210 215 220 

GGT ACA GAC CAT ATT GAT CAG TTG AAG CTC ATT TTA AGA CTC GTT GGA 72 0 

25 Gly Thr Asp His He Asp Gin Leu Lys Leu lie Leu Arg Leu Val Gly 
225 230 235 240 

ACC CCA GGG GCT GAG GTT TTG AAG AA_A ATC TCC TCA GAG TCT GCA AGA 768 
Thr Pro Gly Ala Glu Leu Leu Lys Lys He Ser Ser Glu Ser Ala Arg 
30 245 250 255 

AAC TAT ATT CAG TCT TTG ACT CAG ATG CCG AAG ATG AAC TTT GCG AAT 816 

Asn Tyr He Gin Ser Leu Thr Gin Met Pro Lys Met Asn Phe Ala Asn 

260 265 270 

35 

GTA TTT ATT GGT GCC AAT CCC CTG GCT GTC GAC TTG CTG GAG AAG ATG 8 64 

Val Phe He Gly Ala Asn Pro Leu Ala Val Asp Leu Leu Glu Lys Met 
275 280 285 

40 CTT GTA TTG GAC TCA GAT AAG AGA ATT ACA GCG GCC CAA GCC CTT GCA 912 
Leu Val Leu Asp Ser Asp Lys Arg He Thr Ala Ala Gin Ala Leu Ala 
290 295 300 

CAT GCC TAG TTT GCT CAG TAG CAC GAT CCT GAT GAT GAA CCA GTG GCC 960 
45 His Ala Tyr Phe Ala Gin Tyr His Asp Pro Asp Asp Glu Pro Val Ala 
305 310 315 320 

GAT CCT TAT GAT CAG TCC TTT GAA AGC AGG GAC CTC CTT ATA GAT GAG lOOP 
Asp Pro Tyr Asp Gin Ser Ph*^ ^-Vi ^o- :^ ^--^ - ^ 



34 



350 



CTT GAC CAA GAA GAG AT 



GAT CCA 



\^ o 
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Leu Asp Gin Glu Glu Met Glu Ser Glu Asp Pro Pro Val Ala Thr Met 
355 360 365 

GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 115 2 

5 Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
370 375 380 

GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 12 0 0 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
10 385 390 395 400 

GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC TGC 12 4 8 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 

405 410 415 

15 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 12 96 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

420 425 430 

20 ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG 13 44 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
435 440 445 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 13 92 
25 His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
450 455 460 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 144 0 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
30 465 470 475 480 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 14 8 8 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 

485 490 495 

35 

GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 153 6 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 

500 505 510 

40 TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 15 84 

Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
515 520 525 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 163 2 
45 lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 
530 535 540 

CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GA^ G^^ ' 

Gin T ! V A ^ r. 



. .:e:. J., n Ser Ala Leu Ser 

565 57 C 575 



55 



AAA GAC CCC AAC GAG r^J\G CGC GAT CAC 
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Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

580 585 590 

ACC GCC GCC GGG ATC ACT CTC GGC ATG GAG GAG CTG TAG AAG TAA 1821 

Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 605 



(2) INFORMATION FOR SEQ ID NO: 65: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 amino acids 

(B) TYPE: amino acid 

(C) STRANDRDNESS : single 

15 {D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 65 : 

Met Ser Gin Glu Arg Pro Thr Phe Tyr Arg Gin Glu Leu Asn Lys Thr 

15 10 15 

lie Trp Glu Val Pro Glu Arg Tyr Gin Asn Leu Ser Pro Val Gly Ser 
25 20 25 30 

Gly Ala Tyr Gly Ser Val Cys Ala Ala Phe Asp Thr Lys Thr Gly Leu 

35 40 45 

Arg Val Ala Val Lys Lys Leu Ser Arg Pro Phe Gin Ser lie lie His 
50 55 60 

30 Ala Lys Arg Thr Tyr Arg Glu Leu Arg Leu Leu Lys His Met Lys His 
65 70 75 80 

Glu Asn Val lie Gly Leu Leu Asp Val Phe Thr Pro Ala Arg Ser Leu 

85 90 95 

Glu Glu Phe Asn Asp Val Tyr Leu Val Thr His Leu Met Gly Ala Asp 
35 100 105 110 

Leu Asn Asn lie Val Lys Cys Gin Lys Leu Thr Asp Asp His Val Gin 

115 120 125 

Phe Leu lie Tyr Gin lie Leu Arg Gly Leu Lys Tyr lie His Ser Ala 
130 135 140 

40 Asp lie lie His Arg Asp Leu Lys Pro Ser Asn Leu Ala Val Asn Glu 
145 150 155 160 

Asp Cys Glu Leu Lys lie Leu Asp Phe Gly Leu Ala Arg His Thr Asp 

165 170 175 

Asp Glu Met Thr Gly Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu 
45 180 185 190 

lie Met Leu Asn Trp Met His Tyr Asn Gin Thr Val Asp lie Trp Ser 

195 200 205 

Val Gly Cys lie Met Ala Glu Leu Leu Thr Gly Ara Thr I.pu Ph^ Pr- 
210 ^ ' 



^ 5 o 



2' 



Gin Ser Leu Thr i^ln ^et Pro Lvg Met Acn Phe Ala Ann 
55 260 265 270 

Val Phe He Gly Ala Asn Pre Leu Ala 
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275 280 285 

Leu Val Leu Asp Ser Asp Lys Arg He Thr Ala Ala Gin Ala Leu Ala 

290 295 300 

His Ala Tyr Phe Ala Gin Tyr His Asp Pro Asp Asp Glu Pro Val Ala 
5 305 310 315 320 

Asp Pro Tyr Asp Gin Ser Phe Glu Ser Arg Asp Leu Leu He Asp Glu 

325 330 335 

Trp Lys Ser Leu Thr Tyr Asp Glu Val He Ser Phe Val Pro Pro Pro 

340 345 350 

10 Leu Asp Gin Glu Glu Met Glu Ser Glu Asp Pro Pro Val Ala Thr Met 

355 360 365 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

370 375 380 

Glu Leu Asp Gly Asp v^il Asn Gly Kia Lys Phe Ser Val Ser Gly Glu 
15 385 390 395 400 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

405 410 415 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

420 425 430 

20 Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 

435 440 445 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

450 455 460 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
25 465 470 475 480 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

485 490 495 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

5O0 505 510 

30 Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 

515 520 525 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

530 535 540 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
35 545 550 555 560 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

565 570 575 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

580 585 590 

40 Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

595 600 605 

(2) INFORMATION FOR SEQ ID NO: 66: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



.A, ^CAME/KKY: Coding Sequence 
55 (E) LCCATION: 1...2 910 

(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

ATG AGT GCT GAG GGG TAC CAG TAG AGA GCG CTG TAT GAT TAT AAA AAG 4 8 

5 Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr Asp Tyr Lys Lys 
15 10 15 

GAA AGA GAA GAA GAT ATT GAC TTG CAC TTG GGT GAC ATA TTG ACT GTG 96 
Glu Arg Glu Glu Asp lie Asp Leu His Leu Gly Asp lie Leu Thr Val 
10 20 25 30 

AAT AAA GGG TCC TTA GTA GCT CTT GGA TTC AGT GAT GGA CAG GAA GCC 14 4 

Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp Gly Gin Glu Ala 
35 40 45 

15 

AGG OCT GAA GAA ATT GGC TGG TTA AAT GGC TAT AAT GAA ACC ACA GGG 192 
Arg Pro Glu Glu lie Gly Trp Leu Asn Gly Tyr Asn Glu Thr Thr Gly 
50 55 60 

20 GAA AGG GGG GAC TTT CCG GGA ACT TAC GTA GAA TAT ATT GGA AGG AAA 24 0 

Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr lie Gly Arg Lys 
65 70 75 80 

AAA ATC TCG CCT CCC ACA CCA AAG CCC CGG CCA CCT CGG CCT CTT CCT 28 8 

25 Lys lie Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 

85 90 95 

GTT GCA CCA GGT TCT TCG AAA ACT GAA GCA GAT GTT GAA CAA CAA GCT 33 6 

Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val Glu Gin Gin Ala 
30 100 105 110 

TTG ACT CTC CCG GAT CTT GCA GAG CAG TTT GCC CCT CCT GAC ATT GCC 3 84 

Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro Pro Asp lie Ala 
115 120 125 

35 

CCG CCT CTT CTT ATC AAG CTC GTG GAA GCC ATT GAA AAG AAA GGT CTG 4 32 

Pro Pro Leu Leu lie Lys Leu Val Glu Ala lie Glu Lys Lys Gly Leu 
130 135 140 

40 GAA TGT TCA ACT CTA TAC AGA ACA CAG AGC TCC AGC AAC CTG GCA GAA 4 80 

Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser Asn Leu Ala Glu 
145 150 155 160 

TTA CGA CAG CTT CTT GAT TGT GAT ACA CCC TCC GTG GAC TTG GAA ATG 52 8 

45 Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val Asp Leu Glu Met 

165 170 175 

ATC GAT GTG CAC GTT TTG GCT GAC GCT TTC AAA CGC TAT TTC CT^. ^7.^ 



. . ■■ ; : /iia '/l^^ "I v r '^tti Giu Me ^ e Ser 

19 5 2C0 205 

55 

TTA GCT CCA GAA GTA CAA AGC TCC lAA ^-AA TA"" A""'" '"A^ ' '■ 
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Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr lie Gin Leu Leu Lys 
210 215 220 



AAG CTT ATT AGG TCG CCT AGC ATA CCT CAT CAG TAT TGG CTT ACG CTT 
Lys Leu lie Arg Ser Pro Ser He Pro His Gin Tyr Trp Leu Thr Leu 
225 230 235 240 



720 



10 



CAG TAT TTG TTA AAA CAT TTC TTC AAG CTC TCT CAA ACC TCC AGC AAA 
Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin Thr Ser Ser Lys 

245 250 255 



768 



15 



AAT CTG TTG AAT GCA AGA GTA CTC TCT GAA ATT TTC AGC CCT ATG CTT 
Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe Spr Pre Met Leu 

265 270 



r- r\ 

^ u u 



TTC AGA TTC TCA GCA GCC AGC TCT GAT AAT ACT GAA AAC CTC ATA AAA 
Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu Asn Leu He Lys 
275 280 285 



816 



864 



20 GTT ATA GAA ATT TTA ATC TCA ACT GAA TGG AAT GAA CGA CAG CCT GCA 
Val lie Glu He Leu He Ser Thr Glu Trp Asn Glu Arg Gin Pro Ala 
290 295 300 



912 



CCA GCA CTG CCT CCT AAA CCA CCA AAA CCT ACT ACT GTA GCC AAC AAC 
25 Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr Val Ala Asn Asn 
305 310 315 320 



960 



30 



GGT ATG AAT AAC AAT ATG TCC TTA CAA AAT GCT GAA TGG TAG TGG GGA 
Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu Trp Tyr Trp Gly 

325 330 335 



1008 



35 



GAT ATC TCG AGG GAA GAA GTG AAT GAA AAA CTT CGA GAT ACA GCA GAC 
Asp He Ser Arg Glu Glu Val Asn Glu Lys Leu Arg Asp Thr Ala Asp 

340 345 350 

GGG ACC TTT TTG GTA CGA GAT GCG TCT ACT AAA ATG CAT GGT GAT TAT 
Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met His Gly Asp Tyr 
355 360 365 



1056 



1104 



40 ACT CTT ACA CTA AGG AAA GGG GGA AAT AAC AAA TTA ATC AAA ATA TTT 
Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu He Lys He Phe 
370 375 380 



1152 



CAT CGA GAT GGG AAA TAT GGC TTC TCT GAC CCA TTA ACC TTC AGT TCT 
45 His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu Thr Phe Ser Ser 
385 390 395 400 



1200 



GTG GTT GAA TTA ATA AAC CAC TAC COG AAT riA^ 



55 



420 



425 



Val Ser Lys Tyr G 
430 
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Gin Asp Gin Val Val Lys Glu Asp Asn He Glu Ala Val Gly Lys Lys 
435 440 445 

TTA CAT GAA TAT AAC ACT CAG TTT CAA GAA AAA AGT CGA GAA TAT GAT 13 92 
5 Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser Arg Glu Tyr Asp 
450 455 460 

AGA TTA TAT GAA GAA TAT ACC CGC ACA TCC CAG GAA ATC CAA ATG AAA 14 4 0 

Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu He Gin Met Lys 
10 465 470 475 480 

AGG ACA GCT ATT GAA GCA TTT AAT GAA ACC ATA AAA ATA TTT GAA GAA 14 8 8 

Arg Thr Ala He Glu Ala Phe Asn Glu Thr He Lys He Phe Glu Glu 

4R5 490 495 

15 

CAG TGC CAG ACC CAA GAG CGG TAC AGC AAA GAA TAG ATA GAA AAG TTT 15 3 6 

Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr He Glu Lys Phe 

500 505 510 

20 AAA CGT GAA GGC AAT GAG AAA GAA ATA CAA AGG ATT ATG CAT AAT TAT 15 84 

Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He Met His Asn Tyr 
515 520 525 

GAT AAG TTG AAG TCT CGA ATC AGT GAA ATT ATT GAC AGT AGA AGA AGA 163 2 

25 Asp Lys Leu Lys Ser Arg He Ser Glu He He Asp Ser Arg Arg Arg 
530 535 540 

TTG GAA GAA GAC TTG AAG AAG CAG GCA GCT GAG TAT CGA GAA ATT GAC 1680 
Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr Arg Glu He Asp 
30 545 550 555 560 

AAA CGT ATG AAC AGC ATT AAA CCA GAC CTT ATC CAG CTG AGA AAG ACG 172 8 

Lys Arg Met Asn Ser He Lys Pro Asp Leu He Gin Leu Arg Lys Thr 

565 570 575 

35 

AGA GAC CAA TAC TTG ATG TGG TTG ACT CAA AAA GGT GTT CGG CAA AAG 17 7 6 

Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly Val Arg Gin Lys 

580 585 590 

40 AAG TTG AAC GAG TGG TTG GGC AAT GAA AAC ACT GAA GAC CAA TAT TCA 18 24 

Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu Asp Gin Tyr Ser 
595 600 605 

CTG GTG GAA GAT GAT GAA GAT TTG CCC CAT CAT GAT GAG AAG ACA TGG 18 72 

45 Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp Glu Lys Thr Trp 
610 615 620 

AAT GTT GGA AGC AGC AAC CGA AAC AAA GCT GAA AAC CT^^ T^^- ^"^^ 
Asn Val Glv ^'-^^ * 



■ : . ■- ........ . .n - V. c: bei l.ys (.3 1m Giy C\' 

645 650 655 

55 

lI\L lO^L. lUv„ iv-^i '^liA O-^O ^^'O ^^A^ ..-A^A-. .7..'^ ^ 
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Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys His Cys Val He 

660 665 670 

AAC AAA ACA OCA ACT GGC TAT GGC TTT GCC GAG CCC TAT AAC TTG TAG 2 0 64 
5 Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro Tyr Asn Leu Tyr 

675 680 685 

AGC TOT CTG AAA GAA CTG GTG CTA CAT TAG CAA CAC ACC TCC CTT GTG 2112 
Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His Thr Ser Leu Val 
10 690 695 700 

CAG CAC AAC GAC TCC CTC AAT GTC ACA CTA GCC TAC CCA GTA TAT GCA 216 0 

Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr Pro Val Tyr Ala 

705 710 715 720 

15 

CAG CAG AGG CGA CAG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC 2208 

Gin Gin Arg Arg Gin Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 

725 730 735 

20 GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC 2256 
Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 

740 745 750 

GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT 2 3 04 

25 Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 

755 760 765 

GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG 2 3 52 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
30 770 775 780 

CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG 2 4 00 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 

785 790 795 800 

35 

CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC 2 44 8 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 

805 810 815 

40 AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC 2 4 96 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 

820 825 830 

AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC 2 54 4 
45 Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 

835 840 845 

GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAP T' 

^n^^ '^V>' "^n-- 



55 



G65 



87 



Ar,n ^; e r His 
880 



AA''^ i xAi A 1 v„ M. i. l7 Vt_^ Hi^'^J L/\." - 
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Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 

885 890 895 

TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC 273 6 
Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 

900 905 910 

CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC 2 7 84 
His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
915 920 925 

GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC 2 83 2 
Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
020 935 940 

GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG 2 880 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
945 950 955 960 

ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 2 913 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

965 970 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr Asp Tyr Lys Lys 

15 10 15 

Glu Arg Glu Glu Asp He Asp Leu His Leu Gly Asp He Leu Thr Val 

20 25 30 

Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp Gly Gin Glu Ala 

35 40 45 

Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn Glu Thr Thr Gly 

50 55 60 

Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr He Gly Arg Lys 
65 70 75 80 

Lys He Ser Pro Pro Thr Pro Lys Pro Ara Pro Pr^ A^n P^^ ^ ^^"^ 



Pro Pro Leu Leu He Lye Leu Val Glu Ala 

13 0 13 5 

Glu Cvs Ser Thr Leu T\'r Arc 'T^^.- ^ ^ 



125 

le Glu Lye LyB Gly Leu 
14 0 
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145 150 155 160 

Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val Asp Leu Glu Met 

165 170 175 

lie Asp Val His Val Leu Ala Asp Ala Phe Lys Arg Tyr Leu Leu Asp 
5 180 185 190 

Leu Pro Asn Pro Val lie Pro Ala Ala Val Tyr Ser Glu Met lie Ser 

195 200 205 

Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr lie Gin Leu Leu Lys 
210 215 220 

10 Lys Leu lie Arg Ser Pro Ser lie Pro His Gin Tyr Trp Leu Thr Leu 
225 230 235 240 

Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin Thr Ser Ser Lys 

245 250 255 

Asn Leu Leu Asn Aid Arg Val Leu Ser Glu lie Phe Ser Pro Met Leu 
15 260 265 270 

Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu Asn Leu lie Lys 

275 280 285 

Val lie Glu lie Leu lie Ser Thr Glu Trp Asn Glu Arg Gin Pro Ala 
290 295 300 

20 Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr Val Ala Asn Asn 
305 310 315 320 

Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu Trp Tyr Trp Gly 

325 330 335 

Asp lie Ser Arg Glu Glu Val Asn Glu Lys Leu Arg Asp Thr Ala Asp 
25 340 345 350 

Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met His Gly Asp Tyr 

355 360 365 

Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu lie Lys lie Phe 
370 375 380 

30 His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu Thr Phe Ser Ser 
385 390 395 400 

Val Val Glu Leu lie Asn His Tyr Arg Asn Glu Ser Leu Ala Gin Tyr 

405 410 415 

Asn Pro Lys Leu Asp Val Lys Leu Leu Tyr Pro Val Ser Lys Tyr Gin 
35 420 425 430 

Gin Asp Gin Val Val Lys Glu Asp Asn lie Glu Ala Val Gly Lys Lys 

435 440 445 

Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser Arg Glu Tyr Asp 
450 455 460 

40 Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu He Gin Met Lys 
465 470 475 4B0 

Arg Thr Ala He Glu Ala Phe Asn Glu Thr He Lys He Phe Glu Glu 

485 490 495 

Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr He Glu Lys Phe 
45 500 505 510 

Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He Met His Asn Tyr 

515 520 525 

Asp Lys Leu Lys Ser Arg He Ser Glu Tie T'.p Arr ' ^" ■ " 

c: -> ^ 



55 



y Asp Gin Tyr Leu KeL Trp Leu Thr Gin Lys Gly Val Arg Gin Lys 
580 585 590 

Lys Leu Asn Glu Trp Leu GVv' A^;n ^^ -i • : . ' 
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595 600 605 

Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp Glu Lys Thr Trp 

610 615 620 

Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn Leu Leu Arg Gly 
5 625 630 635 640 

Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser Lys Gin Gly Cys 

645 650 655 

Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys His Cys Val He 

660 665 670 

10 Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro Tyr Asn Leu Tyr 

675 680 685 

Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His Thr Ser Leu Val 

690 695 700 

Gin His AFin Asp Scr Leu Asn val Thr Leu Ala Tyr Pro Val Tyr Ala 
15 705 710 715 720 

Gin Gin Arg Arg Gin Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 

725 730 735 

Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly 

740 745 750 

20 Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 

755 760 765 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 

770 775 780 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
25 785 790 795 800 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 

805 810 815 

Lys Ser Ala Met Fro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 

820 825 830 

30 Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 

835 840 845 

Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu 

850 855 860 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
35 865 870 875 880 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 

885 890 895 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 

900 905 910 

40 His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 

915 920 925 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 

930 935 940 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
45 945 950 955 960 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

965 970 



(C) STRAilDEDMESS : single 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
5 (B) LOCATION: 1...1785 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

10 ATG GGC AAC GCC GCC GCC GCC AAG AAG GGC AGC GAG CAG GAG AGC GTG 4 8 

Met Gly Asn Ala Ala Ala Ala Lys Lys Gly Ser Glu Gin Glu Ser Val 
15 10 15 

AAA GAG TTC CTA GCC AAA GCC AAG GAA GAT TTC CTG AAA AAA TGG GAA 9 6 

15 Lys Glu Phe Leu Ala Lys Ala Lys Glu Asp Phe Leu Lys Lys Trp Glu 

20 25 30 

GAC CCC TCT CAG AAT ACA GCC CAG TTG GAT CAG TTT GAT AGA ATC AAG 144 
Asp Pro Ser Gin Asn Thr Ala Gin Leu Asp Gin Phe Asp Arg lie Lys 
20 35 40 45 

ACC CTT GGC ACC GGC TCC TTT GGG CGA GTG ATG CTG GTG AAG CAC AAG 192 

Thr Leu Gly Thr Gly Ser Phe Gly Arg Val Met Leu Val Lys His Lys 

50 55 60 

25 

GAG AGT GGG AAC CAC TAG GCC ATG AAG ATC TTA GAC AAG CAG AAG GTG 24 0 

Glu Ser Gly Asn His Tyr Ala Met Lys lie Leu Asp Lys Gin Lys Val 
65 70 75 80 

30 GTG AAG CTA AAG CAG ATC GAG CAC ACT CTG AAT GAG AAG CGC ATC CTG 2 88 

Val Lys Leu Lys Gin lie Glu His Thr Leu Asn Glu Lys Arg lie Leu 

85 90 95 

CAG GCC GTC AAC TTC CCG TTC CTG GTC AAA CTT GAA TTC TCC TTC AAG 3 36 

35 Gin Ala Val Asn Phe Pro Phe Leu Val Lys Leu Glu Phe Ser Phe Lys 

100 105 110 

GAC AAC TCA AAC CTG TAG ATG GTC ATG GAG TAT GTA GCT GGT GGC GAG 3 84 

Asp Asn Ser Asn Leu Tyr Met Val Met Glu Tyr Val Ala Gly Gly Glu 
40 115 120 125 

ATG TTC TCC CAC CTA CGG CGG ATT GGA AGG TTC AGC GAG CCC CAT GCC 43 2 

Met Phe Ser His Leu Arg Arg lie Gly Arg Phe Ser Glu Pro His Ala 
130 135 140 



45 



CGT TTC TAC GCG GCG CAG ATC GTC CTG ACC TTT GAG TAT CTG CAC TCC 4 80 

Arg Phe Tyr Ala Ala Gin lie Val Leu Thr Phe Glu Tyr Leu His Ser 
145 150 155 



CAG CAG CGC TAT ATT CAG GTC ACA GAC TTC GGT TTT GCC AAG CGT GTG 57 6 

55 Gin Gin Gly Tyr lie Gin Val Thr Asp Phe Gly Phe Ala Lys Arg Val 

18 0 IB- : 
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AAA GGC CGT ACT TGG ACC TTG TOT GGG ACC CCT GAG TAG TTG GCC CCC 6 24 

Lys Gly Arg Thr Trp Thr Leu Cys Gly Thr Pro Glu Tyr Leu Ala Pro 
195 200 205 

5 

GAG ATT ATC CTG AGC AAA GGC TAG AAC AAG GCT GTG GAC TGG TGG GCT 672 
Glu lie lie Leu Ser Lys Gly Tyr Asn Lys Ala Val Asp Trp Trp Ala 
210 215 220 

10 CTC GGA GTC CTC ATC TAG GAG ATG GCT GCT GGT TAG CCA CCC TTC TTC 72 0 

Leu Gly Val Leu lie Tyr Glu Met Ala Ala Gly Tyr Pro Pro Phe Phe 
225 230 235 240 

GCT GAC CAG CCT ATC CAG ATU TAT GAG AAA ATC GTC TCT GGG AAG GTG 768 
15 Ala Asp Gin Pro lie Gin lie Tyr Glu Lys He Val Ser Gly Lys Val 

245 250 255 

CGG TTC CCA TCC CAC TTC AGC TCT GAC TTG AAG GAC CTG CTG CGG AAC 816 
Arg Phe Pro Ser His Phe Ser Ser Asp Leu Lys Asp Leu Leu Arg Asn 
20 260 265 270 

CTT CTG CAA GTG GAT CTA ACC AAG CGC TTT GGA AAC CTC AAG GAC GGG 8 64 

Leu Leu Gin Val Asp Leu Thr Lys Arg Phe Gly Asn Leu Lys Asp Gly 
275 280 285 

25 

GTC AAT GAC ATC AAG AAC CAC AAG TGG TTT GCC ACG ACT GAC TGG ATT 912 

Val Asn Asp He Lys Asn His Lys Trp Phe Ala Thr Thr Asp Trp He 
290 295 300 

30 GCC ATC TAT CAG AG A AAG GTG GAA GCT CCC TTC ATA CCA AAG TTT AAA 960 
Ala He Tyr Gin Arg Lys Val Glu Ala Pro Phe He Pro Lys Phe Lys 
305 310 315 320 

GGC CCT GGG GAC ACG AGT AAC TTT GAC GAC TAT GAG GAG GAA GAG ATC 10 0 8 

35 Gly Pro Gly Asp Thr Ser Asn Phe Asp Asp Tyr Glu Glu Glu Glu He 

325 330 335 

CGG GTC TCC ATC AAT GAG AAG TGT GGC AAG GAG TTT ACT GAG TTT GGG 10 5 6 

Arg Val Ser He Asn Glu Lys Cys Gly Lys Glu Phe Thr Glu Phe Gly 
40 340 345 350 

CGC GCC ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT 1104 
Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
355 360 365 



45 



CTT GTT GAA TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT AGT 1152 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser 

370 375 380 



ATT TGC ACT ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC ACT 124 8 

55 :le CyG Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

4 C 5 A'^ 
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ACT CTC ACT TAT GGT GTT CAA TGC TTT TCT AGA TAG CCA GAT CAT ATG 12 96 

Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 

420 425 430 

5 

AAA CAG CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG 13 44 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
435 440 445 

10 GAA AGA ACT ATA TTT TAC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT 13 92 

Glu Arg Thr lie Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
450 455 460 

GAA GTC AAG TTT GAA G^T G.AT ACC CTT GTT AAT AGA ATC GAG TTA AAA 144 0 

15 Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 
465 470 475 480 

GGT ATT GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA ATG GAA 14 8 8 

Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu 
20 485 490 495 

TAC AAT TAT AAC TCA CAT AAT GTA TAC ATC ATG GCA GAC AAA CCA AAG 153 6 

Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro Lys 

500 505 510 

25 

AAT GGC ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AAA GAT GGA 15 84 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Lys Asp Gly 
515 520 525 

30 AGC GTT CAA TTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT 163 2 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
530 535 540 

GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACG CAA TCT GCC 16 8 0 

35 Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
545 550 555 560 

CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT GAG 172 8 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu 
40 565 570 575 

TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 17 7 6 
Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 

580 585 590 



45 



CCT CAG GAG TAA 178 8 

Pro Gin Glu 
595 



;A} Lr^NGTH : 595 aTiino acids 
55 (B) TYPE: amine acid 

(C) STRAITOEDNESS : Single 
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(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Gly Asn Ala Ala Ala Ala Lys Lys Gly Ser Glu Gin Glu Ser Val 
15 10 15 

10 Lys Glu Phe Leu Ala Lys Ala Lys Glu Asp Phe Leu Lys Lys Trp Glu 

20 25 30 

Asp Pro Ser Gin Asn Thr Ala Gin Leu Asp Gin Phe Asp Arg lie Lys 

35 40 45 

Thr Leu Gly Thr r;ly Ser Phe Gly Axy Val Met Leu Val Lys His Lys 
15 50 55 60 

Glu Ser Gly Asn His Tyr Ala Met Lys lie Leu Asp Lys Gin Lys Val 
65 70 75 80 

Val Lys Leu Lys Gin lie Glu His Thr Leu Asn Glu Lys Arg lie Leu 

85 90 95 

20 Gin Ala Val Asn Phe Pro Phe Leu Val Lys Leu Glu Phe Ser Phe Lys 

100 105 110 

Asp Asn Ser Asn Leu Tyr Met Val Met Glu Tyr Val Ala Gly Gly Glu 

115 120 125 

Met Phe Ser His Leu Arg Arg lie Gly Arg Phe Ser Glu Pro His Ala 
25 130 135 140 

Arg Phe Tyr Ala Ala Gin lie Val Leu Thr Phe Glu Tyr Leu His Ser 
145 150 155 160 

Leu Asp Leu lie T^^r Arg Asp Leu Lys Pro Glu Asn Leu Leu lie Asp 

165 170 175 

30 Gin Gin Gly Tyr lie Gin Val Thr Asp Phe Gly Phe Ala Lys Arg Val 

180 185 190 

Lys Gly Arg Thr Trp Thr Leu Cys Gly Thr Pro Glu Tyr Leu Ala Pro 

195 200 205 

Glu lie lie Leu Ser Lys Gly Tyr Asn Lys Ala Val Asp Trp Trp Ala 
35 210 215 220 

Leu Gly Val Leu lie Tyr Glu Met Ala Ala Gly Tyr Pro Pro Phe Phe 
225 230 235 240 

Ala Asp Gin Pro lie Gin lie Tyr Glu Lys lie Val Ser Gly Lys Val 

245 250 255 

40 Arg Phe Pro Ser His Phe Ser Ser Asp Leu Lys Asp Leu Leu Arg Asn 

260 265 270 

Leu Leu Gin Val Asp Leu Thr Lys Arg Phe Gly Asn Leu Lys Asp Gly 

275 280 285 

Val Asn Asp lie Lys Asn His Lys Trp Phe Ala Thr Thr Asp Trp lie 
45 290 295 300 

Ala lie Tyr Gin Arg Lys Val Glu Ala Pro Phe lie Pro Lys Phe Lys 
305 310 315 320 

Gly Pro Gly Asp Thr Ser Asn Phe Asp Asp Tyr Glu Glu Glu Glu lie 



55 



■ '.. . J D ,T 

L.(>\.j Val G]ii Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser 

370 375 380 

Gly Glu Gly Glu Gly Asp Ala Thr Tvr Glv Lys Lev Thr ' r^-. - 
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385 390 395 400 

lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

405 410 415 

Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
5 420 425 430 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 

435 440 445 

Glu Arg Thr lie Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
450 455 460 

10 Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 
465 470 475 480 

Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Met Glu 

485 490 455 

Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Pro Lys 
15 500 505 510 

Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He Lys Asp Gly 

515 520 525 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp 
530 535 540 

20 Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
545 550 555 560 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu 

565 570 575 

Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
25 580 585 590 

Pro Gin Glu 
595 



30 



45 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 {D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

40 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2178 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATG AGC GAC GTG GCT ATT GTG AAG GAG GGT TGG CTG CAC AAA CGA GGG 4 8 

Met Ser Asp Val Ala He Val Lys Glu Gly Trp Leu His Lys Arg Gly 



GGC ACC TTC ATT GGC TAC AAG GAG CGG CCG CAG GAT GTG GAC CAA CGT 144 
55 Gly Thr Phe He Gly Tyr Lys Glu Arg Pro Gin Asp Val Asp Gin Arc 

1 ^ 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



155 



GAG OCT CCC CTC AAC AAC TTC TCT GTG GCG CAG TGC CAG CTG ATG AAG 192 

Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gin Cys Gin Leu Met Lys 
50 55 60 

5 

ACG GAG CGG CCC CGG CCC AAC ACC TTC ATC ATC CGC TGC CTG CAG TGG 24 0 

Thr Glu Arg Pro Arg Pro Asn Thr Phe lie He Arg Cys Leu Gin Trp 

65 70 75 80 

10 ACC ACT GTC ATC GAA CGC ACC TTC CAT GTG GAG ACT CCT GAG GAG CGG 28 8 

Thr Thr Val He Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 

85 90 95 

GAG GAG TGG ACA ACC GCn ATC CAG ACT GTG GOT GAC GGC CTC AAG AAG 33 6 

15 Glu Glu Trp Thr Thr Ala He Gin Thr Val Ala Asp Gly Leu Lys Lys 

100 105 110 

CAG GAG GAG GAG GAG ATG GAC TTC CGG TCG GGC TCA CCC AGT GAC AAC 3 84 

Gin Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
20 115 120 125 

TCA GGG OCT GAA GAG ATG GAG GTG TCC CTG GCC AAG CCC AAG CAC CGC 4 32 

Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 
130 135 140 

25 

GTG ACC ATG AAC GAG TTT GAG TAC CTG AAG CTG CTG GGC AAG GGC ACT 4 80 

Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145 150 155 160 

30 TTC GGC AAG GTG ATC CTG GTG AAG GAG AAG GCC ACA GGC CGC TAC TAC 52 8 

Phe Gly Lys Val He Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 

165 170 175 

GCC ATG AAG ATC CTC AAG AAG GAA GTC ATC GTG GCC AAG GAC GAG GTG 5 76 

35 Ala Met Lys He Leu Lys Lys Glu Val He Val Ala Lys Asp Glu Val 

180 185 190 

GCC CAC ACA CTC ACC GAG AAC CGC GTC CTG CAG AAC TCC AGG CAC CCC 624 
Ala His Thr Leu Thr Glu Asn Arg Val Leu Gin Asn Ser Arg His Pro 
40 195 200 205 

TTC CTC ACA GCC CTG AAG TAC TCT TTC CAG ACC CAC GAC CGC CTC TGC 672 
Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gin Thr His Asp Arg Leu Cys 
210 215 220 



45 



TTT GTC ATG GAG TAC GCC AAC GGG GGC GAG CTG TTC TTC CAC CTG TCC 72 0 

Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225 230 235 240 



ATT GTG TCA GGC CTG GAC TAC CTG CAC TGG GAG AAG AAC GTG GTG TAC 8 16 

55 He Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 

260 265 
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CGG GAC CTC AAG CTG GAG AAC CTC ATG CTG GAC AAG GAC GGG CAC ATT 8 64 

Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly His He 
275 280 285 

5 

AAG ATC ACA GAC TTC GGG CTG TGC AAG GAG GGG ATC AAG GAC GGT GCC 912 
Lys He Thr Asp Phe Gly Leu Cys Lys Glu Gly He Lys Asp Gly Ala 
290 295 300 

10 ACC ATG AAG ACC TTT TGC GGC ACA CCT GAG TAC CTG GCC CCC GAG GTG 960 
Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305 310 315 320 

CTG GAG GAC A AT GAC TAC GGC CGT GUA GTG GAC TGG TGG GGG CTG GGC 10 08 

15 Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 

325 330 335 

GTG GTC ATG TAC GAG ATG ATG TGC GGT CGC CTG CCC TTC TAC AAC CAG 10 56 

Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gin 
20 340 345 350 

GAC CAT GAG AAG CTT TTT GAG CTC ATC CTC ATG GAG GAG ATC CGC TTC 1104 

Asp His Glu Lys Leu Phe Glu Leu lie Leu Met Glu Glu He Arg Phe 

355 360 365 

25 

CCG CGC ACG CTT GGT CCC GAG GCC AAG TCC TTG CTT TCA GGG CTG CTC 1152 

Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 

370 375 380 

30 AAG AAG GAC CCC AAG CAG AGG CTT GGC GGG GGC TCC GAG GAC GCC AAG 12 00 

Lys Lys Asp Pro Lys Gin Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385 390 395 400 

GAG ATC ATG CAG CAT CGC TTC TTT GCC GGT ATC GTG TGG CAG CAC GTG 12 4 8 

35 Glu He Met Gin His Arg Phe Phe Ala Gly He Val Trp Gin His Val 

405 410 415 

TAC GAG AAG AAG CTC AGC CCA CCC TTC AAG CCC CAG GTC ACG TCG GAG 12 96 

Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gin Val Thr Ser Glu 
40 420 425 430 

ACT GAC ACC AGG TAT TTT GAT GAG GAG TTC ACG GCC CAG ATG ATC ACC 13 44 

Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gin Met He Thr 
435 440 445 

45 

ATC ACA CCA CCT GAC CAA GAT GAC AGC ATG GAG TGT GTG GAC AGC GAG 13 92 

He Thr Pro Pro Asp Gin Asp Asp Ser Met Glu Cys Val Asp Ser Glu 
450 455 460 



TCG GAT CCA CCG GTC GCC i\CC ATG GTG AGC 7iAG GGC GAG GAG CTG TTC 14Bfi 
55 Ser Asp Pre Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 

4 85 '.9^' '^^ 
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ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC 153 6 

Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

500 505 510 

5 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAG GGC 15 84 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
515 520 525 

10 AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 163 2 

Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro 
530 535 540 

THG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC 16 8 0 

15 Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
545 550 555 560 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 172 8 
Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
20 565 570 575 

CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 177 6 

Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 

580 585 590 

25 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 1824 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 

535 600 605 

30 AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 1872 
Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie 
610 615 620 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC 192 0 

35 Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie 
625 630 635 640 

ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC 196 8 

Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg 
40 645 650 655 

CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG 2 016 

His Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

660 665 670 



45 



AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 2 06 4 

Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
675 680 68^ 



CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC 2 160 

55 His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly 
705 71 n 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



158 



5 



ATG GAC GAG CTG TAG AAG TAA 2181 
Met Asp Glu Leu Tyr Lys 

725 



(2) INFORMATION FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 72 6 amino acids 

(B) TYPE: amino acid 
(C} STRANDEDNESS : single 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 71: 

20 Met Ser Asp Val Ala lie Val Lys Glu Gly Trp Leu His Lys Arg Gly 
15 10 15 

Glu Tyr lie Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 

20 25 30 

Gly Thr Phe lie Gly Tyr Lys Glu Arg Pro Gin Asp Val Asp Gin Arg 
25 35 40 45 

Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gin Cys Gin Leu Met Lys 

50 55 60 

Thr Glu Arg Pro Arg Pro Asn Thr Phe lie lie Arg Cys Leu Gin Trp 
65 70 75 BO 

30 Thr Thr Val He Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 

85 90 95 

Glu Glu Trp Thr Thr Ala He Gin Thr Val Ala Asp Gly Leu Lys Lys 

100 105 110 

Gin Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
35 115 120 125 

Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 

130 135 140 

Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145 150 155 160 

40 Phe Gly Lys Val He Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 

165 170 175 

Ala Met Lys He Leu Lys Lys Glu Val He Val Ala Lys Asp Glu Val 

180 185 190 

Ala His Thr Leu Thr Glu Asn Arg Val Leu Gin Asn Ser Arg His Pro 
45 195 200 205 

Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gin Thr His Asp Arg Leu Cys 

210 215 220 

Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225 ^ 



55 



'\rg Asp Leu Lys Leu Giu Asn Leu ^'et Leu Asp Lys Asp Gly Hie lie 

275 280 285 

Lys He Thr Asp Phe Gly Leu Cys Lys Glu GIv Tie Lvs ^-^^ ' - 
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10 



15 



20 



25 



30 



35 



40 



45 





290 










295 










300 










Thr 


Met 


Lys 


Thr 


Phe 


Cys 


Gly 


Thr 


Pro 


Glu 


Tyr 


Leu 


Ala 


Pro 


Glu 


Val 


305 










310 










315 










320 


Leu 


Glu 


Asp 


Asn 


Asp 


Tyr 


Gly 


Arg 


Ala 


Val 


Asp 


Trp 


Trp 


Gly 


Leu 


Gly 










325 










330 










335 




Val 


Val 


Met 


Tyr 


Glu 


Met 


Met 


Cys 


Gly 


Arg 


Leu 


Pro 


Phe 


Tyr 


Asn 


Gin 








340 










345 










350 






Asp 


His 


Glu 


Lys 


Leu 


Phe 


Glu 


Leu 


He 


Leu 


Met 


Glu 


Glu 


He 


Arg 


Phe 






355 










360 










365 








Pro 


Arg 


Thr 


Leu 


Gly 


Pro 


Glu 


Ala 


Lys 


Ser 


Leu 


Leu 


Ser 


Gly 


Leu 


Leu 




370 










375 










380 










Lys 


Lys 


Asp 


Pro 


Lys 


Gin 


Arg 


Leu 


Gly 


Gly Gly 


Ser 


Glu 


Asp 


Ala 


Lys 


385 










390 










395 










-r w <^ 


Glii 


T 1 *^ 

^ -1. ^ 






His 


Arg 


Phe 


Phe 


Ala 


Gly 


He 


Val 


Trp 


Gin 


His 


Val 










405 










410 










415 




Tyr 


Glu 


Lys 


Lys 


Leu 


Ser 


Pro 


Pro 


Phe 


Lys 


Pro 


Gin 


Val 


Thr 


Ser 


Glu 








420 










425 










430 






Thr 


Asp 


Thr 


Arg 


Tyr 


Phe 


Asp 


Glu 


Glu 


Phe 


Thr 


Ala 


Gin 


Met 


He 


Thr 






435 










440 










445 








lie 


Thr 


Pro 


Pro 


Asp 


Gin 


Asp 


Asp 


Ser 


Met 


Glu 


Cys 


Val 


Asp 


Ser 


Glu 




450 










455 










460 










Arg 


Arg 


Pro 


His 


Phe 


Pro 


Gin 


Phe 


Ser 


Tyr 


Ser 


Ala 


Ser 


Ser 


Thr 


Ala 


465 










470 










475 










480 


Ser 


Asp 


Pro 


Pro 


Val 


Ala 


Thr 


Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 










485 










490 










495 




Thr 


Gly 


Val 


Val 


Pro 


He 


Leu 


Val 


Glu 


Leu 


Asp 


Gly Asp 


Val 


Asn 


Gly 








500 










505 










c 1 n 

^ V 










Fhe 


Ser 


Val 


Ser 


Gly 


Glu 


Gly 


Glu 


Gly Asp 


Ala 


Thr 


Tyr 


Gly 






515 










520 










525 








LyB 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


Cys 


Thr 


Thr 


Gly Lys 


Leu 


Pro 


Val 


Pro 




530 










535 










540 










Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 


Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


545 










550 










555 










560 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 










565 










570 










575 




Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 


Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 








580 










585 










590 






Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


Val 


Lys 


Phe 


Glu 


Gly Asp 


Thr 


Leu 


Val 






595 










600 










605 








Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 


He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 




610 










615 










620 










Leu 


Gly 


His 


Lys 


Leu 


Glu 


Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


625 










630 










635 










640 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 










645 










650 










655 




His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 


Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 








660 










665 










670 






Asn 


Thr 


Pro 


He 


Gly 


Asp 


Gly 


Pro 


Val 






r V- 











55 



Met AsTJ 



lu Leu .yr Ly 
725 
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(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2751 base pairs 

5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
10 (ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2 748 
iu) UTHER INFORMATION: 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

ATG GCT GAC GTT TAC CCG GCC AAC GAC TCC ACG GCG TCT CAG GAC GTG 4 8 

Met Ala Asp Val Tyr Pro Ala Asn Asp Ser Thr Ala Ser Gin Asp Val 
20 1 5 10 15 

GCC AAC CGC TTC GCC CGC AAA GGG GCG CTG AGG CAG AAG AAC GTG CAT 96 
Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 

20 25 30 

25 

GAG GTG AAA GAC CAC AAA TTC ATC GCC CGC TTC TTC AAG CAA CCC ACC 144 
Glu Val Lys Asp His Lys Phe He Ala Arg Phe Phe Lys Gin Pro Thr 
35 40 45 

30 TTC TGC AGC CAC TGC ACC GAC TTC ATC TGQ GGG TTT GGG AAA CAA GGC 192 
Phe Cys Ser His Cys Thr Asp Phe He Trp Gly Phe Gly Lys Gin Gly 
50 55 60 

TTC CAG TGC CAA GTT TGC TGT TTT GTG GTT CAT AAG AGG TGC CAT GAG 24 0 

35 Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 BO 

TTC GTT ACG TTC TCT TGT CCG GGT GCG GAT AAG GGA CCT GAC ACT GAC 28 8 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 
40 85 90 95 

GAC CCC AGG AGC AAG CAC AAG TTC AAA ATC CAC ACA TAC GGA AGC CCT 33 6 

Asp Pro Arg Ser Lys His Lys Phe Lys He His Thr Tyr Gly Ser Pro 

100 105 110 

45 

ACC TTC TGT GAT CAC TGT GGG TCC CTG CTC TAT GGA CTT ATC CAC CAA 3 84 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 

115 ir^o 



ATC AAT GAC CCT AGC CTC TGC GGA ATG GAT CAC ACA GAG AAG AGG GGG 4 8 

55 He Asn Asp Fro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arc Glv 
145 : ^ ■ ^ ■■ 
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CGG ATT TAT CTG AAG OCT GAG GTC ACT GAT GAA AAG CTC CAC GTC ACG 52 8 

Arg He Tyr Leu Lys Ala Glu Val Thr Asp Glu Lys Leu His Val Thr 

165 170 175 

5 

GTA CGA GAT GCA AAA AAT CTA ATC CCT ATG GAT CCA AAT GGG CTT TCG 576 

Val Arg Asp Ala Lys Asn Leu He Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 

10 GAT CCT TAT GTG AAG CTG AAA CTA ATC CCT GAC CCC AAG AAT GAG AGC 62 4 

Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Asn Glu Ser 
195 200 205 



15 



45 



AAA CAG AAA ACC AAA ACC ATC CGC TCC AAC CTG AAT CCT CAG TGG AAT 672 
Lys Gin Lys Thr Lys Thr He Arg Ser Asn Leu Asn Pro Gin Trp Asn 
210 215 220 



GAG TCC TTC ACG TTC AAA TTA AAA CCT TCA GAC AAA GAC CGG CGA CTG 72 0 

Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 

20 225 230 235 240 

TCT GTA GAA ATC TGG GAC TGG GAT CGG ACG ACT CGG AAT GAC TTC ATG 76 8 

Ser Val Glu He Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 

245 250 255 

25 

GGA TCC CTT TCC TTT GGT GTC TCA GAG CTA ATG AAG ATG CCG GCC ACT 816 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Fro Ala Ser 

260 265 270 

30 GGA TGG TAT AAA GCT CAC AAC CAA GAA GAG GGC GAA TAT TAC AAC GTG 864 

Gly Trp Tyr Lys Ala His Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

CCC ATT CCA GAA GGA GAT GAA GAA GGC AAC ATG GAA CTC AGG CAG AAG 912 

35 Pro He Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 

TTT GAG AAA GCC AAG CTA GGT CCT GTT GGT AAC AAA GTC ATC AGC CCT 96 0 

Phe Glu Lys Ala Lys Leu Gly Pro Val Gly Asn Lys Val He Ser Pro 

40 305 310 315 320 

TCA GAA GAC AGA AAG CAA CCA TCC AAC AAC CTG GAC AGA GTG AAA CTC 1008 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 

325 330 335 



ACA GAC TTC AAC TTC CTC ATG GTG CTG GGG AAG GGG AGT TTT GGG AAG 10 56 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 

3 4 0 1 4 ■ ^ ' 



ATC CTG AAG AAG GAC GTG GTG ATC CAG GAC GAC GAC GTG GAG TGC ACC 115 
55 He Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 
3 7 0 - - - - r 
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ATG GTG GAG AAG CGC GTG CTG GCC CTG CTG GAG AAG CCG CCA TTT CTG 12 0 0 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

5 

ACA CAG CTG CAC TCC TGC TTC CAG ACA GTG GAC CGG CTG TAC TTC GTC 12 4 8 
Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 

405 410 415 

10 ATG GAA TAC GTC AAC GGC GGG GAT CTT ATG TAC CAC ATT CAG CAA GTC 12 96 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His lie Gin Gin Val 

420 425 430 

GGd AAA TTT AAG GAG CCA UAA GCA GTA TTC TAC GCA GCC GAG ATC TCC 13 44 

15 Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 

435 440 445 

ATC GGA CTG TTC TTC CTT CAT AAA AGA GGG ATC ATT TAC AGG GAT CTG 13 92 

He Gly Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
20 450 455 460 

AAG CTG AAC AAT GTC ATG CTG AAC TCA GAA GGG CAC ATC AAA ATC GCC 144 0 

Lys Leu Asn Asn Val Met Leu Asn Ser Glu Gly His He Lys He Ala 
465 470 475 480 

25 

GAC TTC GGG ATG TGC AAG GAA CAC ATG ATG GAT GGA GTC ACG ACC AGG 14 8 8 

Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly val Thr Thr Arg 

485 490 495 

30 ACC TTC TGC GGA ACT CCG GAC TAC ATT GCC CCA GAG ATA ATC GCT TAC 15 3 6 

Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He He Ala Tyr 

500 505 510 

CAG CCG TAC GGG AAG TCT GTA GAT TGG TGG GCG TAC GGT GTG CTG CTG 15 84 

35 Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 

515 520 525 

TAC GAG ATG CTA GCC GGG CAG CCT CCG TTT GAT GGT GAA GAT GAA GAT 16 3 2 

Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 
40 530 535 540 

GAA CTG TTT CAG TCT ATA ATG GAG CAC AAC GTG TCC TAC CCC AAA TCC 168 0 

Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 
545 550 555 560 



45 



TTG TCC AAG GAA GCC GTC TCC ATC TGC AAA GGA CTT ATG ACC AAA CAG 17 2 8 

Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys Gin 

565 570 ^"^^ 



GAG CAT GCC TTC TTC AGG AGG ATC GAC TGG GAG AAA CTG GAG AAC AGG 1B24 
55 Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arq 

5 9 5 ^ n ^ *^ " " 
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GAG ATC CAA CCA CCA TTC AAG CCC AAA GTG TGT GGC AAA GGA GCA GAA 18 72 

Glu lie Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
610 615 620 

5 

AAC TTT GAC AAG TTC TTC ACG CGA GGA CAG CCT GTC TTA ACA CCA CCA 192 0 

Asn Phe Asp Lya Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 
625 630 635 640 

10 GAT CAG CTG GTC ATT OCT AAC ATA GAC CAA TCT GAT TTT GAA GGG TTC 196 8 

Asp Gin Leu Val lie Ala Asn lie Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 

TCG TAT GTC AAC CCC CAU TTT GTG CAC CCA ATC TTG CAA AGT GCA GTA 2 016 

15 Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 

660 665 670 

GGG CGC GCC ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA 2 064 

Gly Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
20 675 680 665 

ATT CTT GTT GAA TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT 2112 

He Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val 
690 695 700 



25 



45 



AGT GGA GAG GGT GAA GGT GAT GCA ACA TAG GGA AAA CTT ACC CTT AAA 2160 
Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 

710 715 720 



T n tr 



30 TTT ATT TGC ACT ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC 22 08 

Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

725 730 735 

ACT ACT CTC ACT TAT GGT GTT CAA TGC TTT TCT AGA TAC CCA GAT CAT 225 6 

35 Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

740 745 750 

ATG AAA CAG CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA 2 3 04 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
40 755 760 765 

CAG GAA AGA ACT ATA TTT TAC AAA GAT GAC GGG AAC TAC AAG ACA CGT 23 5 2 

Gin Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg 
770 775 780 



GCT GAA GTC AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA 2 4 00 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
785 790 "79=^ 



GAA TAC AAT TAT AAC TCA CAT /^T GTA TAC ATC ATG GCA GAC AAA CCA 24 96 

55 Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro 

8 2 0 - r - 
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AAG AAT GGC ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AAA GAT 2 544 

Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Lys Asp 
835 840 845 

5 

GGA AGC GTT CAA TTA GCA GAG CAT TAT CAA CAA AAT ACT CCA ATT GGC 2 5 92 
Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 
850 855 860 

10 GAT GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACG CAA TCT 2 64 0 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
865 870 875 880 

GCC CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT 26 88 
15 Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu 

865 890 895 

GAG TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC 27 3 6 
Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr 
20 900 905 910 

AAA CCT CAG GAG TAA 2 751 

Lys Pro Gin Glu 
915 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 7*^ 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 916 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



Met Ala Asp Val Tyr Pro Ala Asn Asp Ser Thr Ala Ser Gin Asp Val 

15 10 15 

Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 

20 25 30 

Glu Val Lys Asp His Lys Phe He Ala Arg Phe Phe Lys Gin Pro Thr 
45 35 40 45 

Phe Cys Ser His Cys Thr Asp Phe He Trp Gly Phe Gly Lys Gin Gly 

50 55 60 

Phe Gin Cys Gin Val Cvs Cvh ph^ v-^ 



-CC 105 lie 

Thr Fhe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 
55 115 120 125 



Cj^"/ Met L'VF C.''/r^. A -r- '^--r- T.'-^ '/.-I* 
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130 135 140 

lie Asn Asp Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
145 150 155 160 

Arg lie Tyr Leu Lys Ala Glu Val Thr Asp Glu Lys Leu His Val Thr 
5 165 170 175 

Val Arg Asp Ala Lys Asn Leu lie Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 

Asp Pro Tyr Val Lys Leu Lys Leu lie Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

10 Lys Gin Lys Thr Lys Thr lie Arg Ser Asn Leu Asn Pro Gin Trp Asn 
210 215 220 

Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

Rer Val Glu lie Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
15 245 250 255 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 

260 265 270 

Gly Trp Tyr Lys Ala His Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

20 Pro lie Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 

Phe Glu Lys Ala Lys Leu Gly Pro Val Gly Asn Lys Val lie Ser Pro 
305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
25 325 330 335 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 

340 345 350 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala lie Lys 
355 360 365 

30 lie Leu Lys Lys Asp Val Val lie Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
35 405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His lie Gin Gin Val 

420 425 430 

Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 
435 440 445 

40 He Gly Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
450 455 460 

Lys Leu Asn Asn Val Met Leu Asn Ser Glu Gly His He Lys He Ala 
465 470 475 480 

Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 
45 485 490 495 

Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He He Ala Tyr 

500 505 510 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trr A^. q '^-^ ^r^- --r - 



r.-:^ 55 5 56 0 

Leu SQL LvG Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys Gin 
55 565 570 575 

Pro Ala Lvs Arq Leu GV/ ^-/p ^,}\' ^--^ -^-^ ■.■ ^ ■ ; 
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580 585 590 

Glu His Ala Phe Phe Arg Arg lie Asp Trp Glu Lys Leu Glu Asn Arg 

595 600 605 

Glu lie Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
5 610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 
625 630 635 640 

Asp Gin Leu Val lie Ala Asn lie Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 

10 Ser Tyr Val Asn Pro Gin Phe Val His Pro lie Leu Gin Ser Ala Val 

S60 665 670 

Gly Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 

675 680 €S5 

lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val 
15 690 695 700 

Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
705 710 715 720 

Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

725 730 735 

20 Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

740 745 750 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 

755 760 765 

Gin Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg 
25 770 775 780 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
785 790 795 800 

Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met 

805 810 815 

30 Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro 

820 825 830 

Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Lys Asp 

835 840 845 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 
35 850 855 860 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
B65 870 875 880 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu 

885 890 895 

40 Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr 

900 905 910 

Lys Pro Gin Glu 
915 

45 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2157 base pair? 



(IX) FEATURE: 

55 

(A) NA.ME/KKY: "^rd:^-: 
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1b 



25 



30 



35 



45 



167 

(B) LOCATION: 1. . ,2154 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

ATG TCG TCC ATC TTG CCA TTC ACG CCG CCA GTT GTG AAG AGA CTG CTG 4 8 

Met Ser Ser He Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
IS 10 15 

GGA TGG AAG AAG TCA GCT GGT GGG TCT GGA GGA GCA GGC GGA GGA GAG 96 
Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 

20 25 30 

CAG AAT GGG CAG C-A.^ GAA AAG i'GG TGT GAG AAA GCA GTG AAA AGT CTG 14 4 

Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
35 40 45 



GTG AAG AAG CTA AAG AAA ACA GGA CGA TTA GAT GAG CTT GAG AAA GCC 192 
Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
20 50 55 60 

ATC ACC ACT CAA AAC TGT AAT ACT AAA TGT GTT ACC ATA CCA AGC ACT 24 0 

He Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr 
65 70 75 80 



TGC TCT GAA ATT TGG GGA CTG AGT ACA CCA AAT ACG ATA GAT CAG TGG 288 

Cys Ser Glu He Trp Gly Leu Ser Thr Pro Asn Thr He Asp Gin Trp 

S5 90 95 

GAT ACA ACA GGC CTT TAC AGC TTC TCT GAA CAA ACC AGG TCT CTT GAT 336 

Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp 

100 105 110 

GGT CGT CTC CAG GTA TCC CAT CGA AAA GGA TTG CCA CAT GTT ATA TAT 3 84 

Gly Arg Leu Gin Val Ser His Arg Lys Gly Leu Pro His Val He Tyr 

120 125 



TGC CGA TTA TGG CGC TGG CCT GAT CTT CAC AGT CAT CAT GAA CTC AAG 43 2 

Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 
^0 130 135 140 

GCA ATT GAA AAC TGC GAA TAT GCT TTT AAT CTT AAA AAG GAT GAA GTA 480 

Ala He Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
145 150 155 160 



TGT GTA AAC CCT TAC CAC TAT CAG AGA GTT GAG ACA CCA GTT TTG CCT 52 8 

Cys Val Asn Pro Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro 

165 170 - 



GAT GAC TAT ACT CAC TCC ATT CCA G/\A AAC ACT AAC TTC CCA GCA 
55 Leu Asp Asp Tyr Thr His Ser He Pro Glu Asn Thr Asn Phe Pro Ala 

195 2C0 



624 
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GGA ATT GAG CCA CAG AGT AAT TAT ATT CCA GAA ACG CCA CCT CCT GGA 672 

Gly He Glu Pro Gin Ser Asn Tyr He Pro Glu Thr Pro Pro Pro Gly 
210 215 220 

5 

TAT ATC AGT GAA GAT GGA GAA ACA AGT GAC CAA CAG TTG AAT CAA AGT 72 0 

Tyr He Ser Glu Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser 

225 230 235 240 

10 ATG GAC ACA GGC TCT CCA GCA GAA CTA TCT CCT ACT ACT CTT TCC CCT 76 8 

Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro 

245 250 255 



15 



25 



35 



45 



55 



GTT AAT CAT AGC TTU GAT TTA CAG CCA GTT ACT TAC TCA GAA CCT GCA 816 
Val Asn His Ser Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala 

260 265 270 



TTT TGG TGT TCA ATA GCA TAT TAT GAA TTA AAT CAG AGG GTT GGA GAA 8 64 

Phe Trp Cys Ser He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu 
20 275 280 285 

ACC TTC CAT GCA TCA CAG CCC TCA CTC ACT GTA GAT GGC TTT ACA GAC 912 
Thr Phe His Ala Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp 
290 295 300 



CCA TCA AAT TCA GAG AGG TTC TGC TTA GGT TTA CTC TCC AAT GTT AAC 960 
Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn 

310 315 320 



T n n 
-* w J 



30 CGA AAT GCC ACG GTA GAA ATG ACA AGA AGG CAT ATA GGA AGA GGA GTG 10 0 8 

Arg Asn Ala Thr Val Glu Met Thr Arg Arg His He Gly Arg Gly Val 

325 330 335 



CGC TTA TAC TAC ATA GGT GGG GAA GTT TTT GCT GAG TGC CTA AGT GAT 105 6 

Arg Leu Tyr Tyr He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 

340 345 350 



AGT GCA ATC TTT GTG CAG AGC CCC AAT TGT AAT CAG AGA TAT GGC TGG 1104 
Ser Ala He Phe Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp 
40 355 360 365 

CAC CCT GCA ACA GTG TGT AAA ATT CCA CCA GGC TGT AAT CTG AAG ATC 115 2 

His Pro Ala Thr Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He 
370 375 380 



TTC AAC AAC CAG GAA TTT GCT GCT CTT CTG GCT CAG TCT GTT AAT CAG 12 0 0 

Phe Asn Asn Gin Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin 
385 390 I 



'1 i. 



AGT TTT GTG TU\A GGG TGG GGA GCA GAA TAC CGA AGG CAG ACG GTA AGA 12 9 6 

Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr 

4 2 0 ^ " ^ 
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AGT ACT CCT TGC TGG ATT GAA CTT CAT CTG AAT GGA CCT CTA GAG TGG 1344 

Ser Thr Pro Cys Trp lie Glu Leu His Leu Asn Gly Pro Leu Gin Trp 

435 440 445 

5 

TTG GAG AAA GTA TTA ACT CAG ATG GGA TCC CCT TCA GTG CGT TGC TCA 13 92 

Leu Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cya Ser 

450 455 460 

10 AGC ATG TCA TGG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG 144 0 

Ser Met Ser Trp Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met 

465 470 475 480 

GTfJ AGC AJ^.G GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 148 8 

15 Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

485 490 495 

GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 153 6 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 500 505 510 

GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC TGC 1584 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 

515 520 525 

25 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 163 2 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

530 535 540 

30 ACC TAG GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG CAG 16 BO 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 

545 550 555 560 



35 



45 



CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAG GTC CAG GAG CGC 172 8 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

565 570 575 



ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 177 6 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

40 580 585 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 182 4 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 

595 600 605 



GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 187 2 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
610 615 



ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 196 8 

55 He Lys Val Asn Phe Lys He Arg His Asn He Glu Aep Gly Ser Val 

6 4 5 - 
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CAG CTC GCC GAG CAC TAG GAG GAG AAG AGG CGG ATG GGC GAG GGG CGC 2 016 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 

660 665 670 

5 

GTG CTG CTG CCC GAG AAC CAC TAG CTG AGG ACC CAG TCC GCC CTG AGC 2 0 64 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
675 680 685 

10 AAA GAC GCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG 2112 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
690 695 700 

ACC GCC GCC GGG ATC ACT CTC GGC ATG GAG GAG CTG TAG AAG TAA 2157 
15 Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
705 710 715 



(2) INFORMATION FOR SEQ ID NO: 75 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

Met Ser Ser lie Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 

15 10 15 

Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
35 20 25 30 

Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 

35 40 45 

Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
50 55 60 

40 lie Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr lie Pro Ser Thr 
65 70 75 80 

Cys Ser Glu lie Trp Gly Leu Ser Thr Pro Asn Thr lie Asp Gin Trp 

85 90 95 

Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp 
45 100 105 110 

Gly Arg Leu Gin Val Ser His Arg Lys Gly Leu Pro His Val lie Tyr 

115 120 125 

Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser Hip }V. ^^l'! ' 



ioL- 17u 

t'lo Val Leu Val Pro Arg His Thr Glu He Leu Thr Glu Leu Pro Pro 
55 180 185 190 

Leu Asp Asp Tyr Thr H:s Spr : ~^:'t' ■"]-.'■ ' 
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Gly lie Glu 

210 
Tyr lie Ser 
5 225 

Met Asp Thr 

Val Asn His 

10 Phe Trp Cys 

275 

Thr Phe His 

290 
Pro Ser ASii 
15 305 

Arg Asn Ala 

Arg Leu Tyr 

20 Ser Ala He 

355 

His Pro Ala 

370 
Phe Asn Asn 
25 385 

Gly Phe Glu 

Ser Phe Val 

30 Ser Thr Pro 

435 

Leu Asp Lys 

450 
Ser Met Ser 
35 465 

Val Ser Lys 

Glu Leu Asp 

40 Gly Glu Gly 

515 

Thr Thr Gly 
530 

Thr Tyr Gly 
45 545 

His Asp Phe 

Thr He Phe 



200 

Pro Gin Ser Asn Tyr 

215 

Glu Asp Gly Glu Thr 
230 

Gly Ser Pro Ala Glu 
245 

Ser Leu Asp Leu Gin 
260 

Ser He Ala Tyr Tyr 

280 

Ala Ser Gin Pro Ser 

295 

Ser Glu Arg Phe Cys 
310 

Thr Val Glu Met Thr 
325 

Tyr He Gly Gly Glu 
340 

Phe Val Gin Ser Pro 

360 

Thr Val Cys Lys He 

375 

Gin Glu Phe Ala Ala 
390 

Ala Val Tyr Gin Leu 
405 

Lys Gly Trp Gly Ala 
420 

Cys Trp He Glu Leu 

440 

Val Leu Thr Gin Met 

455 

Trp Val Pro Arg Ala 
470 

Gly Glu Glu Leu Phe 
485 

Gly Asp Val Asn Gly 
500 

Asp Ala Thr Tyr Gly 

520 

Lys Leu Pro Val Pro 

535 

Val Gin Cys Phe Ser 
550 

Phe Lys Ser Ala Met 
565 

Phe Lys Asp Asp Glv 



171 

He Pro Glu Thr 

220 

Ser Asp Gin Gin 
235 

Leu Ser Pro Thr 
250 

Pro Val Thr Tyr 
265 

Glu Leu Asn Gin 

Leu Thr Val Asp 

300 

Leu Gly Leu Leu 
315 

Arg Arg His He 
330 

Val Phe Ala Glu 
345 

Asn Cys Asn Gin 

Pro Pro Gly Cys 

380 

Leu Leu Ala Gin 
395 

Thr Arg Met Cys 
410 

Glu Tyr Arg Arg 
425 

His Leu Asn Gly 

Gly Ser Pro Ser 

460 

Arg Asp Pro Pro 
475 

Thr Gly Val Val 
490 

His Lys Phe Ser 
505 

Lys Leu Thr Leu 

Trp Pro Thr Leu 

540 

Arg Tyr Pro Asp 
555 

Pro Glu Gly Tyr 
570 

Asn Tvr T.v-^. '^^^ 



205 

Pro Pro Pro Gly 

Leu Asn Gin Ser 

240 

Thr Leu Ser Pro 
255 

Ser Glu Pro Ala 
270 

Arg Val Gly Glu 
285 

Gly Phe Thr Asp 

Ser Asn Val Asn 

320 

Gly Arg Gly Val 
335 

Cys Leu Ser Asp 
350 

Arg Tyr Gly Trp 
365 

Asn Leu Lys He 

Ser Val Asn Gin 

400 

Thr He Arg Met 

Gin Thr Val Thr 
430 

Pro Leu Gin Trp 
445 

Val Arg Cys Ser 

Val Ala Thr Met 

480 

Pro He Leu Val 
495 

Val Ser Gly Glu 
510 

Lys Phe He Cys 
525 

Val Thr Thr Leu 

His Met Lys Gin 

560 

Val Gin Glu Arg 
575 

A n Oil ' ' 



55 



■ . ^ eib 62 0 

Tyr Asn Ser llis Asr: Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 

625 630 635 64C 

He Lv5 Val Asn Phe T.'/r r-.-'- - ■ "* : : v.: 
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25 



172 

645 650 655 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 

660 665 670 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

675 6B0 685 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

690 695 700 

Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
705 710 715 

(2) INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 57 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
20 (ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2394 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



ATQ GAC AAT ATG TCT ATT ACG AAT ACA CCA ACA AGT AAT GAT GCC TOT 4 8 

Met Asp Asn Met Ser lie Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys 
30 1 5 10 15 

CTG AGC ATT GTG CAT AGT TTG ATG TGC CAT AGA CAA GGT GGA GAG AGT 96 
Leu Ser lie Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser 

20 25 30 

35 

GAA ACA TTT GCA AAA AGA GCA ATT GAA AGT TTG GTA AAG AAG CTG AAG 14 4 

Glu Thr Phe Ala Lys Arg Ala lie Glu Ser Leu Val Lys Lys Leu Lys 
35 40 45 

40 GAG AAA AAA GAT GAA TTG GAT TCT TTA ATA ACA GCT ATA ACT ACA AAT 192 
Glu Lys Lys Asp Glu Leu Asp Ser Leu lie Thr Ala lie Thr Thr Asn 
50 55 60 

GGA GCT CAT CCT AGT AAA TGT GTT ACC ATA CAG AGA ACA TTG GAT GGG 24 0 

45 Gly Ala His Pro Ser Lys Cys Val Thr lie Gin Arg Thr Leu Asp Gly 
65 70 75 80 



AGG CTT CAG GTG GCT GGT CGG AAA GGA TTT CCT ^AT ^'^^ A'^^ 



100 



eu HiB Lys Asn 
105 



Lu Leu Lys Hi 
110 



55 



AAA TAT TGT CAG TAT GC'l 
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Lys Tyr Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val 

120 

AAT CCA TAT CAC TAC GAA CGA GTT GTA TCA CCT GGA ATT GAT CTC TCA 
Asn Pro Tyr His Tyr Glu Arg Val Val Ser Pro Gly He Asp Leu Ser 

135 140 

GGA TTA ACA CTG GAG AGT AAT GCT CCA TCA AGT ATG ATG GTG AAG GAT 
Gly Leu Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp 

145 150 Tc-ii 

-L^u 155 160 

GAA TAT GTG CAT GAC TTT GAG GGA CAG CCA TCG TTG TCC ACT GAA GGA 
Glu Tyr Val His Asp Phe Glu Gly Gin Pro Ser Leu Spr Thr Glu Gly 

170 175 

CAT TCA ATT CAA ACC ATC CAG CAT CCA CCA AGT AAT CGT GCA TCG ACA 
His ser He Gin Thr He Gin His Pro Pro Ser Asn Arg Ala Ser Thr 

180 185 190 

GAG ACA TAC AGC ACC CCA GCT CTG TTA GCC CCA TCT GAG TCT AAT GCT 
Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala 
1^5 200 205 

ACC AGC ACT GCC AAC TTT CCC AAC ATT CCT GTG GCT TCC ACA AGT CAG 
Thr Ser Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin 
210 215 220 

CCT GCC AGT ATA CTG GGG GGC AGC CAT AGT GAA GGA CTG TTG CAG ATA 
Pro Ala Ser He Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He 

230 235 240 

GCA TCA GGG CCT CAG CCA GGA CAG CAG CAG AAT GGA TTT ACT GGT CAG 
Ala Ser Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin 

245 250 255 

CCA GCT ACT TAC CAT CAT AAC AGC ACT ACC ACC TGG ACT GGA AGT AGO 
Pro Ala Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg 

265 270 

ACT GCA CCA TAC ACA CCT AAT TTG CCT CAC CAC CAA AAC GGC CAT CTT 
Thr Ala Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu 

280 285 

CAG CAC CAC CCG CCT ATG CCG CCC CAT CCC GGA CAT TAC TGG CCT GTT 
Gin His His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val 
250 295 300 

CAC AAT GAG CTT GCA TTC CAG CCT CCC ATT TCr aA'^ ^^^-^ ^^-^ ^-^-^ 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



yr r::e G^u Met Asp Vai Gin Val Gly 
^25 



GAG ACA TTT AAG GTT CCT ^"A A^^^ ^^^^ 
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Glu Thr Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly 

340 345 350 

TAC GTG GAG CCT TCT GGA GGA GAT CGC TTT TGT TTG GGT CAA CTC TCC 1104 
5 Tyr Val Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser 

355 360 365 

AAT GTC CAC AGG ACA GAA GCC ATT GAG AGA GCA AGG TTG CAC ATA GGC 1152 
Asn Val His Arg Thr Glu Ala lie Glu Arg Ala Arg Leu His He Gly 
370 375 380 

AAA GGT GTG GAG TTG GAA TGT AAA GGT GAA GGT GAT GTT TGG GTC AGG 12 0 0 

Lys Gly Val Gin Leu Glu Cys Lys Gly Gin Gly Asp Val Trp Val Arg 

390 395 400 



15 



20 



25 



35 



40 



45 



T O rr 
u ^ 



TGC CTT AGT GAC CAC GCG GTC TTT GTA CAG AGT TAC TAC TTA GAC AGA 124 6 

Cys Leu Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg 

405 410 415 

GAA GCT GGG CGT GCA CCT GGA GAT GCT GTT CAT AAG ATC TAC CCA AGT 12 96 

Glu Ala Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser 

420 425 430 

GCA TAT ATA AAG GTC TTT GAT TTG CGT CAG TGT CAT CGA CAG ATG CAG 13 44 

Ala Tyr He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin 

435 440 445 



CAG CAU GCG GCT ACT GCA CAA GCT GCA GCA GCT GCC CAG GCA GCA GCC 13 92 

Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala 
30 450 455 460 

GTG GCA GGA AAC ATC CCT GGC CCA GGA TCA GTA GGT GGA ATA GCT CCA 14 4 0 
Val Ala Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro 

470 475 



GCT ATC AGT CTG TCA GCT GCT GCT GGA ATT GGT GTT GAT GAC CTT CGT 14 8 8 

Ala He Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg 

485 490 

CGC TTA TGC ATA CTC AGG ATG AGT TTT GTG AAA GGC TGG GGA CCG GAT 153 6 

Arg Leu Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp 

500 505 510 

TAC CCA AGA CAG AGC ATC AAA GAA ACA CCT TGC TGG ATT GAA ATT CAC 15 84 

Tyr Pro Arg Gin Ser He Lys Glu Thr Pro Cys Trp He Glu He His 
515 520 525 

TTA CAC CGG GCC CTC CAG CTC ^^A r^A^ — 



55 



545 



55 0 



Leu Asp Trp Asp Pre 

555 



•c Val Ala Thr Met 

560 



GTG AGC AAG GGC GA": r?.-. 
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Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

565 570 

GAG CTG GAC GGC GAG GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 1776 

5 Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

580 

GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC 182 4 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
595 600 605 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 1872 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr L£u 

610 bib 620 



15 



20 



25 



35 



40 



45 



ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG 192 0 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 

630 635 640 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 196 8 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

645 650 655 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 2 016 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

660 665 670 



AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 2064 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

30 675 680 685 

GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 2112 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

690 695 700 



TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 216 0 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
705 710 715 720 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 22 0 8 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

725 730 735 

CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC 2 2 56 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 

740 745 750 

GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC Arn - - 



55 



IS Met 



770 



77 5 



'al Leu Leu Glu Phe Va 
7 8 0 



ACC GCC GCC GGG ATC ACT CTC ^^.^ 7.; 
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Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 



5 (2) INFORMATION FOR SEQ ID NO: 77 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



20 



30 



40 



(ii) MOLECULE TYPE: protein 
(v) FPJi.GMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



Met Asp Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys 

15 10 15 

Leu Ser He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser 

20 25 30 

Glu Thr Phe Ala Lys Arg Ala He Glu Ser Leu Val Lys Lys Leu Lys 

35 40 45 

Glu Lys Lys Asp Glu Leu Asp Ser Leu He Thr Ala He Thr Thr Asn 
25 50 55 60 

Gly Ala His Pro Ser Lys Cys Val Thr He Gin Arg Thr Leu Asp Gly 
^5 70 75 SO 

Arg Leu Gin Val Ala Gly Arg Lys Gly Phe Pro His Val He Tyr Ala 

85 90 95 

Arg Leu Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val 

100 105 110 

Lys Tyr Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val 

lis 120 125 

Asn Pro Tyr His Tyr Glu Arg Val Val Ser Pro Gly He Asp Leu Ser 
35 130 135 140 

Gly Leu Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp 

150 155 
Glu Tyr Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly 

165 170 175 

His Ser He Gin Thr He Gin His Pro Pro Ser Asn Arg Ala Ser Thr 

180 185 
Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala 

195 200 205 

Thr Ser Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin 
45 210 215 220 

Pro Ala Ser He Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He 
225 230 235 240 

Ala Ser Gly Pro Gin Pro Gly Gin Gin Gin A--^ -^"^ 



-..i^n liis His Pro Pro Met Pro 
55 290 295 

His Asn Glu Leu Ala Phe Gin 



-BO 285 

Pro His Pro Gly His T>^r Trp Fro Val 

300 

Pre r r ' ■ " ■ 
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20 



177 

310 315 320 

Glu Tyr Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly 

325 330 335 

Glu Thr Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly 

340 345 
Tyr Val Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser 

355 360 365 

Asn Val His Arg Thr Glu Ala He Glu Arg Ala Arg Leu His He Gly 
370 375 3QQ 

Lys Gly Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg 

390 395 400 

Cys Leu Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg 

405 410 
Glu Ala Gly Arg Ala Fro Gly Asp Ala Val His Lys He Tyr Pro Ser 

420 425 430 

Ala Tyr He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin 

435 440 445 

Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala 
450 455 

Val Ala Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro 
465 470 475 480 

Ala He Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg 

485 490 495 

Arg Leu Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp 
25 500 505 510 

Tyr Pro Arg Gin Ser He Lys Glu Thr Pro Cys Trp He Glu He His 

515 520 525 

Leu His Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Met Pro 

530 535 540 

He Ala Asp Pro Gin Pro Leu Asp Trp Asp Pro Pro Val Ala Thr Met 
545 550 555 560 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

565 570 575 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
35 580 585 590 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

595 600 605 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

615 620 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
^25 630 635 640 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

645 650 655 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

660 665 670 

Lya Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

675 680 685 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His T.vn " ' 
6^0 



30 



40 



3 0 



735 



;.n Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
55 740 750 

Val Leu Leu Pro Asd Asn Hvr^ T,*^'; ^'^^^ ^ ■ * ' . ■ 
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755 760 765 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

770 775 780 

Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
5 785 790 795 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 313 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .3135 
20 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



25 



35 



40 



45 



ATG GCG GGC TGG ATC GAG GCC CAG CAG CTG CAG GGA GAC GCG CTG CGC 4 8 

Met Ala Gly Trp He Gin Ala Gin Gin Leu Gin Gly Asp Ala Leu Arg 
is 10 15 



CAG ATG CAG GTG CTG TAC GGC CAG CAC TTC CCC ATC GAG GTC CGG CAC 96 

Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro He Glu Val Arg His 

30 20 25 30 

TAC TTG GCC CAG TGG ATT GAG AGC CAG CCA TGG GAT GCC ATT GAC TTG 144 

Tyr Leu Ala Gin Trp He Glu Ser Gin Pro Trp Asp Ala He Asp Leu 
35 40 45 



GAC AAT CCC CAG GAC AGA GCC CAA GCC ACC CAG CTC CTG GAG GGC CTG 192 

Asp Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin Leu Leu Glu Gly Leu 

50 55 eo 

GTG CAG GAG CTG CAG AAG AAG GCG GAG CAC CAG GTG GGG GAA GAT GGG 24 0 

Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin Val Gly Glu Asp Gly 

65 70 75 80 

TTT TTA CTG AAG ATC AAG CTG GGG CAC TAC GCC ACG CAG CTC CAG AAA 28 8 

Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala Thr Gin Leu Gin Lys 

85 90 95 

ACA TAT GAC CGC TGC CCC CTG GAG CTG GTC CGC r^^^ "^^'"^ ^^7/^ 



115 



Arg G^^u A^a Asn Asn Cys Ser Ser 



120 



55 



125 



CCG GCT GGG ATC CTG GTT GAC G^r r/ 
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Pro Ala Gly He Leu Val Asp Ala Met Ser Gin Lys His Leu Gin He 
130 135 140 

AAC CAG ACA TTT GAG GAG CTG CGA CTG GTC ACG CAG GAC ACA GAG AAT 4 80 

5 Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr Gin Asp Thr Glu Asn 
145 150 155 160 

GAG CTG AAG AAA CTG CAG CAG ACT CAG GAG TAC TTC ATC ATC CAG TAG 52 8 

Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr Phe He He Gin Tyr 
10 165 170 175 

CAG GAG AGC CTG AGG ATC CAA GCT CAG TTT GCC CAG CTG GCC CAG CTG 5 76 

Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala nin Leu Ala Glu Leu 

180 185 190 

15 

AGC CCC CAG GAG CGT CTG AGC CGG GAG ACG GCC CTC CAG CAG AAG CAG 624 
Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala Leu Gin Gin Lys Gin 
195 200 205 

20 GTG TCT CTG GAG GCC TGG TTG CAG CGT GAG GCA CAG ACA CTG CAG CAG 6 72 

Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala Gin Thr Leu Gin Gin 
210 215 220 

TAC CGC GTG GAG CTG GCC GAG AAG CAC CAG AAG ACC CTG CAG CTG CTG 72 0 

25 Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys Thr Leu Gin Leu Leu 
225 230 235 240 

CGG AAG CAG CAG ACC ATC ATC CTG GAT GAC GAG CTG ATC CAG TGG AAG 7 68 

Arg Lys Gin Gin Thr He He Leu Asp Asp Glu Leu He Gin Trp Lys 
30 245 250 255 

CGG CGG CAG CAG CTG GCC GGG AAC GGC GGG CCC CCC GAG GGC AGC CTG 816 
Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro Pro Glu Gly Ser Leu 

260 265 270 

35 

GAC GTG CTA CAG TCC TGG TGT GAG AAG TTG GCC GAG ATC ATC TGG CAG 6 64 

Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala Glu He He Trp Gin 
275 280 285 

40 AAC CGG CAG CAG ATC CGC AGG GCT GAG CAC CTC TGC CAG CAG CTG CCC 912 
Asn Arg Gin Gin He Arg Arg Ala Glu His Leu Cys Gin Gin Leu Pro 
290 295 300 

ATC CCC GGC CCA GTG GAG GAG ATG CTG GCC GAG GTC AAC GCC ACC ATC 96 0 

45 He Pro Gly Pro Val Glu Glu Met Leu Ala Glu Val Asn Ala Thr He 
305 310 315 320 

ACG GAC ATT ATC TCA GCC CTG GTG A^^ ^-^■'^ " — ' ■ ' 



Thr 



G^r. Thr Lys Fhe Ala Ala Thr Val 



55 



340 



345 



350 



r"^ ^rT^/~t f^'T^r-t r~iry/^ /-^ f-^ r- r-" 
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Arg Leu Leu Val Gly Gly Lys Leu Asn Val His Met Asn Pro Pro Gin 
355 360 365 

GTG AAG GCC ACC ATC ATC AGT GAG CAG CAG GCC AAG TOT CTG CTT AAA 1152 
5 Val Lys Ala Thr He He Ser Glu Gin Gin Ala Lys Ser Leu Leu Lys 
370 ■ 375 380 

AAT GAG AAC ACC CGC AAC GAG TGC AGT GGT GAG ATC CTG AAC AAC TGC 12 0 0 

Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu He Leu Asn Asn Cys 
10 385 390 395 400 

TGC GTG ATG GAG TAC CAC CAA GCC ACG GGC ACC CTC AGT GCC CAC TTC 124 8 

Cys Val Met Glu Tyr His Gin Ala Thr Gly Thr Leu Ser Ala Kls Phe 

^05 410 415 



15 



20 



25 



35 



40 



45 



AGG AAC ATG TCA CTG AAG AGG ATC AAG CGT GCT GAC CGG CGG GGT GCA 12 96 

Arg Asn Met Ser Leu Lys Arg He Lys Arg Ala Asp Arg Arg Gly Ala 

420 425 430 

GAG TCC GTG ACA GAG GAG AAG TTC ACA GTC CTG TTT GAG TCT CAG TTC 1344 

Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu Phe Glu Ser Gin Phe 
435 440 445 

AGT GTT GGC AGC AAT GAG CTT GTG TTC CAG GTG AAG ACT CTG TCC CTA 13 92 

Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val Lys Thr Leu Ser Leu 
450 455 



CCT GTG GTT GTC ATC GTC CAC GGC AGC CAG GAC CAC AAT GCC ACG GCT 144 0 
Pro Val Val Val He Val His Gly Ser Gin Asp His Asn Ala Thr Ala 
30 465 470 475 480 

ACT GTG CTG TGG GAC AAT GCC TTT GCT GAG CCG GGC AGG GTG CCA TTT 14 8 8 

Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro Gly Arg Val Pro Phe 

485 490 495 



GCC GTG CCT GAC AAA GTG CTG TGG CCG CAG CTG TGT GAG GCG CTC AAC 15 3 6 

Ala Val Pro Asp Lys Val Leu Trp Pro Gin Leu Cys Glu Ala Leu Asn 

500 505 510 

ATG AAA TTC AAG GCC GAA GTG CAG AGC AAC CGG GGC CTG ACC AAG GAG 15 8 4 

Met Lys Phe Lys Ala Glu Val Gin Ser Asn Arg Gly Leu Thr Lys Glu 
515 520 525 

AAC CTC GTG TTC CTG GCG CAG AAA CTG TTC AAC AAC AGC AGC AGC CAC 16 3 2 

Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn Asn Ser Ser Ser His 

530 535 54Q 

CTG GAG GAC TAC AGT GGC CTG TCC GTG -"^^ -^"^ ' ' 



55 



565 



■i:ir Phe Trp Gin Trp Fhe Asp Gly 
570 575 



GTG 7iTG GAG GTG TTG AAG A A" 
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Val Met Glu Val Leu Lys Lys His His Lys Pro His Trp Asn Asp Gly 

580 585 

GCC ATC CTA GGT TTT GTG AAT AAG CAA CAG GCC CAC GAG CTG CTC ATC 
5 Ala He Leu Gly Phe Val Asn Lys Gin Gin Ala His Asp Leu Leu He 

595 600 605 

AAC AAG CCC GAG GGG ACC TTC TTG TTG CGC TTT AGT GAG TCA GAA ATC 
Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe Ser Asp Ser Glu He 
^0 615 620 

GGG GGC ATC ACC ATC GCC TGG AAG TTT GAC TCC CCG GAA CGC AAC CTG 
Gly Gly He Thr He Ala Trp Lys Phe Asp Ser Pro Glu Arg Asn Leu 

635 640 

TGG AAC CTG AAA CCA TTC ACC ACG CGG GAT TTC TCC ATC AGG TCC CTG 
Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe Ser He Arg Ser Leu 

650 655 



15 



20 



25 



35 



40 



45 



1824 



1872 



1920 



1968 



GCT GAC CGG CTG GGG GAC CTG AGC TAT CTC ATC TAT GTG TTT CCT GAC 2 016 

Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu He Tyr Val Phe Pro Asp 

660 665 670 

CGC CCC AAG GAT GAG GTC TTC TCC AAG TAC TAG ACT CCT GTG CTG GCT 2 064 

Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr Thr Pro Val Leu Ala 
675 680 685 



AAA GCT GTT GAT GGA TAT GTG AAA CCA CAG ATC AAG CAA GTG GTC CCT 2112 

Lys Ala Val Asp Gly Tyr Val Lys Pro Gin He Lys Gin Val Val Pro 

30 690 695 700 

GAG TTT GTG AAT GCA TCT GCA GAT GCT GGG GGC AGC AGC GCC ACG TAC 2160 

Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly Ser Ser Ala Thr Tyr 
'7°^ 710 715 720 



ATG GAC CAG GCC CCC TCC CCA GCT GTG TGC CCC CAG GCT CCC TAT AAC 22 0 8 
Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro Gin Ala Pro Tyr Asn 

725 730 735 

ATG TAC CCA CAG AAC CCT GAC CAT GTA CTC GAT CAG GAT GGA GAA TTC 22 5 6 

Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp Gin Asp Gly Glu Phe 

740 745 750 

GAC CTG GAT GAG ACC ATG GAT GTG GCC AGG CAC GTG GAG GAA CTC TTA 23 0 4 
Asp Leu Asp Glu Thr Met Asp Val Ala Arg His Val Glu Glu Leu Leu 
755 760 765 

CGC CGA CCA ATG GAC AGT CTT GAC TCC CGC CT"" -^"^ ^^^^ - ' 

Ara Ar'^ r>--- 



55 



7 8 5 



790 



, >-■ r> ■ J C: ^ -^r >- 



7 95 



'ai Pro Arg Ala Arg 

80C 



GAT CCA CCG GTC GCC ACC ATG G"^- A"^ 
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Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 

805 BIO 815 

GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAG GTA AAC GGC CAC 24 96 

5 Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 

820 825 830 

AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG 254 4 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
10 835 840 845 

CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG 2 5 92 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pre Trp 

fi'^fi 855 860 

15 

CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC 2 64 0 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 
865 870 875 880 

20 TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC 2 68 8 

Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

885 890 895 



25 



45 



GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC 2 73 6 
Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 

900 905 910 



TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC 2 7 84 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
30 915 920 925 

CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG 2 832 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
930 935 940 

35 

GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG 2 8 80 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
945 950 955 960 

40 GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC 2 92 8 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His 

965 970 975 



AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC 2 97 6 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 

980 985 990 



ACC CCC ATC GGC GAC GGC CCC GTG CTG CTr: 



55 



-I c 

^ w ^ J 



1010 

ATG GTC CTG CTG GAr; TT'^ ZT^r A"^ 



Pre Asr. Glu Ly^ Arg Asp His 
1020 
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Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met 
1^25 1030 1035 1040 

GAC GAG CTG TAG AAG TAA 3 3_3g 

Asp Glu Leu Tyr Lys 

1045 



10 



(2) INFORMATION FOR SEQ ID NO: 79: 



30 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Ala Gly Trp He Gin Ala Gin Gin Leu Gin Gly Asp Ala Leu Arg 

^5 10 15 

Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro He Glu Val Arg His 
25 20 25 30 

Tyr Leu Ala Gin Trp He Glu Ser Gin Pro Trp Asp Ala He Asp Leu 

35 40 45 

Asp Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin Leu' Leu Glu Gly Leu 

50 55 60 

Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin Val Gly Glu Asp Gly 
65 70 75 BO 

Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala Thr Gin Leu Gin Lys 

85 90 95 

Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg Cys He Arg His He 
35 100 105 110 

Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala Asn Asn Cys Ser Ser 

120 125 
Pro Ala Gly He Leu Val Asp Ala Met Ser Gin Lys His Leu Gin He 
130 135 

Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr Gin Asp Thr Glu Asn 

145 150 155 160 

Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr Phe He He Gin Tyr 

165 170 175 

Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala Gin Leu Ala Gin Leu 

45 180 185 190 

Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala Leu Gin Gin Lys Gin 

195 200 205 

Val Ser Leu Glu Ala Trp Leu Gin Ara nV: -^-^ --v - . 



1: 



Arg Arg Gin Gin Leu Ala 
55 2 60 

Asp Val Leu G^.n Ser ^r-r 



250 255 
Gly Asr: Gly Gly Pro Fro Glu Gly Ser Leu 
265 270 
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Asn Arg Gin 

290 
lie Pro Gly 
5 305 

Thr Asp lie 

Gin Pro Pro 

10 Arg Leu Leu 

355 

Val Lys Ala 

370 
Asn rJlu Asn 
15 385 

Cys Val Met 

Arg Asn Met 

20 Glu Ser Val 

435 

Ser Val Gly 

450 
Pro Val Val 
25 465 

Thr Val Leu 

Ala Val Fro 

30 Met Lys Phe 

515 

Asn Leu Val 
530 

Leu Glu Asp 
35 545 

Glu Asn Leu 

Val Met Glu 

40 Ala lie Leu 

595 

Asn Lys Pro 
610 

Gly Gly lie 
45 625 

Trp Asn Leu 

Ala Asp Arg 



280 

Gin lie Arg Arg Ala 

295 

Pro Val Glu Glu Met 
310 

lie Ser Ala Leu Val 
325 

Gin Val Leu Lys Thr 
340 

Val Gly Gly Lys Leu 

360 

Thr lie lie Ser Glu 

375 

Thr Axg Asn Glu Cys 
390 

Glu Tyr His Gin Ala 
405 

Ser Leu Lys Arg He 
420 

Thr Glu Glu Lys Phe 

440 

Ser Asn Glu Leu Val 

455 

Val lie Val His Gly 
470 

Trp Asp Asn Ala Phe 
485 

Asp Lys Val Leu Trp 
500 

Lys Ala Glu Val Gin 

520 

Phe Leu Ala Gin Lys 

535 

Tyr Ser Gly Leu Ser 
550 

Pro Gly Trp Asn Tyr 
565 

Val Leu Lys Lys His 
580 

Gly Phe Val Asn Lys 

600 

Asp Gly Thr Phe Leu 

615 

Thr He Ala Trp Lys 
630 

Lys Pro Phe Thr Thr 
645 

Leu Gly Asp Leu Ser 



184 

Glu His Leu Cys 

300 

Leu Ala Glu Val 
315 

Thr Ser Thr Phe 
330 

Gin Thr Lys Phe 
345 

Asn Val His Met 

Gin Gin Ala Lys 

3 80 

Ser Gly Glu He 
395 

Thr Gly Thr Leu 
410 

Lys Arg Ala Asp 
425 

Thr Val Leu Phe 

Phe Gin Val Lys 

460 

Ser Gin Asp His 
475 

Ala Glu Pro Gly 
490 

Pro Gin Leu Cys 
505 

Ser Asn Arg Gly 

Leu Phe Asn Asn 

540 

Val Ser Trp Ser 
555 

Thr Phe Trp Gin 

570 

His Lys Pro His 
585 

Gin Gin Ala His 

Leu Arg Phe Ser 

620 

Phe Asp Ser Pro 
635 

Arg Asp Phe Ser 
650 
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285 

Gin Gin Leu Pro 

Asn Ala Thr He 

320 

He He Glu Lys 
335 

Ala Ala Thr Val 
350 

Asn Pro Pro Gin 
365 

Ser Leu Leu Lys 

Leu Asn Asn Cys 

400 

Ser Ala His Phe 
415 

Arg Arg Gly Ala 
430 

Glu Ser Gin Phe 
445 

Thr Leu Ser Leu 

Asn Ala Thr Ala 

480 

Arg Val Pro Phe 
495 

Glu Ala Leu Asn 
510 

Leu Thr Lys Glu 
525 

Ser Ser Ser His 

Gin Phe Asn Arg 

560 

Trp Phe Asp Gly 
575 

Trp Asn Asp Gly 
590 

Asp Leu Leu He 
605 

Asp Ser Glu He 

Glu Arg Asn Leu 

640 

He Arg Ser Leu 
655 



55 



655 70C 

Pne Val Asn Ala Ser Ala Asp Ala Gly Gly Ser Ser Ala Thr Tyr 

710 

Met Asp Gin Ala Pro 5^er r: i -.r - - : - ^ ~ - ■ . 



7C5 
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725 735 
Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp Gin Asp Gly Glu Phe 

740 745 750 

Asp Leu Asp Glu Thr Met Asp Val Ala Arg His Val Glu Glu Leu Leu 

755 760 765 

Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu Ser Pro Pro Ala Gly 

770 775 780 

Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser Trp Val Pro Arg Ala Arg 

790 795 800 

Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 

805 810 815 

Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 

820 825 330 

LvR Phe Scr Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

835 840 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 

850 855 860 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 

870 875 880 

20 Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

885 890 895 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 

900 905 910 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
25 915 920 925 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 

^30 935 540 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 

550 955 
30 Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His 

965 970 975 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 

980 985 990 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
^5 995 1000 1005 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

1010 1015 ^020 

Met val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met 
025 1030 1035 io40 

Asp Glu Leu Tyr Lys 

1045 



40 



(2) INFORMATION FOR SEQ ID NO: 80 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sinale 



TGGGATCCTC AGGCCGTGCT GCTGGCCG no 

55 
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10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 81: 
GTCTCGAGGG AGCATGGGCA CCTTGCG 2 7 

(2) INFORMATION FOR SEQ ID NO: 82: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



20 



25 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TGGGATCCGA GAAGTCTATA TCCCATC 2 7 

(2) INFORMATION FOR SEQ ID NO: 83: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 8 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 

TGGGATCCTT AGAAGTCTAT ATCCCATC 2 8 

(2) INFORMATION FOR SEQ ID NO:84- 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRTPTTO 



[i) SEQUENCE CHARACTERISTICS 
(A) LENGTH: 30 base pairs 
f TYPE : r>;:-l o ; ^ ■ 
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40 



45 



187 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GTGAATTCTC GTCTGATTTC TGGCAGGAGG 3 0 

(2) INFORMATION FOR SEQ ID NO: 86: 



^ 'tLP\l^ Jj E I'J K K S o : G 1 c: _ e 

—J 

;n; TOFOLOGY: linear 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAl^DSDN'ESS : single 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
20 GTGAATTCTT TACGTCTGAT TTCTGGCAGG 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GTCTCGAGCC ATGGACGAAC TGTTCCCCCT CATC 3 4 

35 (2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GTGGATCCAA GGAGCTGATC TGACTCAGCA G 31 
(2) INFORMATION FOR SEQ ID KO-.P^- 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
GTGGATCCTT AGGAGCTGAT CTGACTCAGC AG 
5 (2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

15 

CCTCCTAAGC TTATCATGGA CCATTATGAT TC 

(2) INFORMATION FOR SEQ ID NO: 91: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 91 
CCTCCTGGAT CCCTGCGCAG GATGATGGTC CAG 

(2) INFORMATION FOR SEQ ID NO : 92 : 



30 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

GGATGGAAGC TTCAATGGCT GCCATCCGGA AGAAACTGGT GATTG 
(2) INFORMATION FOR SEQ ID NO: 93: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 



w 



55 GGATGGGGAT CCTCACAAGA CAAGGCAACC AGATTTTTTC TTCCC 



PCT/DK98/00145 



32 



32 



33 



45 
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(2) INFORMATION FOR SEQ ID NO: 94 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 
GGGAAGCTTC CATGAGCGAG ACGGTCATC 

(2) INFORMATION FOR SEQ ID NO: 95: 



29 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 
25 CCCGGATCCT CAGGGAGAAC CCCGCTTC 

(2) INFORMATION FOR SEQ ID 



28 



1^ u : tD : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTGAATTCGA CCATGGAGCG GCCCCCGGGG 3 0 

(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



... INFORmTIGN FOR SEQ IZ N0:9S 
55 (i) SEQUENCE CHARACTERISTICS: 
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10 



20 



30 



35 



45 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GTGGTACCTC ATTCTGTTAA CCAACTCC 

(2} INFORMATION FOR SEQ ID NO: 99: 



28 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTCTCGAGAG ATGCTGTCCC GTGGGTGG 

(2) INFORMATION FOR SEQ ID NO: 10 0: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



28 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 0: 
GTGAATTCGC TTCCTCTTGA GGGAACC 2 7 

(2) INFORMATION FOR SEQ ID NO: 101: 



(i) SEQUENCE CHARACTERISTICS 
(A) LENGTH: 27 base pairs 
"^0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
GTGAATTCAC TTCCTCTTGA GGGAACC 2 7 

(2) TNFOPMA'^'^'"" 



(C, STRANDEDNESS: single 
55 (D) TOPCLCGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
GTCTCGAGCC ATGGAGAACT TCCAAAAGG no 

5 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 8 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GTGGATCCCA GAGTCGAAGA TGGGGTAC 2 8 

(2) INFORMATION FOR SEQ ID NO: 104: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 

25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

30 GTGGATCCTC AGAGTCGAAG ATGGGGTAC 2 9 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 3 0 base pairs 

CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

40 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GTGAATTCGG CGATGCCAGA CCCCGCGGCG 3 0 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 



55 



,xi; SEQUENCE DESCRIPTION; SEQ ID NO : 1 C 5 : 
GTGGATCCCA GGCACAGGCA GCCTCAGCCT 
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(2) INFORMATION FOR SEQ ID NO: 10 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 
Val Glu heu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GTGGATCCTC AGGCACAGGC AGCCTCAGCC TTC 
15 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2616 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

25 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . .2613 
(D) OTHER INFORMATION: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 108: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



96 



40 GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CA^ AT^ r. ^ ^ 

T/H,. "-^Vt^ rT-,,^ " 



85 



90 



J. u L v "C^r' ''J a ^ G 1 n G 1 'l1 

q c; 
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CGC ACC ATC TTC 



AAG GAC GAC r.:.^ ^: 
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Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

5 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG 43 2 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
10 130 135 140 

AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin T.vs n^r. 

iSC 155 ' 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG 6 24 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
1^5 200 205 



15 



20 



25 



AGC AAA c;AC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
30 210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



35 



40 



45 



GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG GCG ATG CCA GAC CCC 76 8 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Met Pro Asp Pro 

245 250 255 

GCG GCG CAC CTG CCC TTC TTC TAC GGC AGC ATC TCG CGT GCC GAG GCC 816 
Ala Ala His Leu Pro Phe Phe Tyr Gly Ser He Ser Arg Ala Glu Ala 

260 265 270 

GAG GAG CAC CTG AAG CTG GCG GGC ATG GCG GAC GGG CTC TTC CTG CTG 8 64 

Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly Leu Phe Leu Leu 
275 280 285 

CGC CAG TGC CTG CGC TCG CTG GGC GGC TAT GTG TT^ T'"^^ ^T^ ^ 



55 



3 0 



3 1 



3 15 



.Gu Asn Gi.y Thr 

320 



TAC GCC ATT GCC GGC GGC i^J^J, GCG 
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Tyr Ala He Ala Gly Gly Lys Ala His Cys Gly Pro Ala Glu Leu Cys 



325 330 



GAG TTC TAG TOG CGC GAG CCC GAG GGG CTG CCC TGC AAC CTG CGC AAG 
Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys Asn Leu Arg Lys 

350 



340 345 



CCG TGC AAC COG CCG TCG GGC CTC GAG CCG CAG CCG GGG GTC TTC GAC 
Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro Gly Val Phe Asp 
355 360 

TGC CTG CGA GAC GCC ATG GTG CGT GAC TAC GTG CGC CAG ACG TGG AAG 
Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arn Gin Thr Tip Lys 

375 380 



CTG GAG GGC GAG GCC CTG GAG CAG GCC ATC ATC AGC CAG GCC CCG CAG 
Leu Glu Gly Glu Ala Leu Glu Gin Ala He He Ser Gin Ala Pro Gin 

395 400 

GTG GAG AAG CTC ATT GCT ACG ACG GCC CAC GAG CGG ATG CCC TGG TAC 
Val Glu Lys Leu He Ala Thr Thr Ala His Glu Arg Met Pro Trp Tyr 

■^"^ 410 415 

CAC AGC AGC CTG ACQ CGT GAG GAG GCC GAG CGC AAA CTT TAC TCT GGG 
His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys Leu Tyr Ser Gly 

420 425 430 



ACA TAC GCC CTG TCC CTC ATC TAT GGG AAG ACG GTG TAC CAC TAC CTC 
Thr Tyr Ala Leu Ser Leu He Tyr Gly Lys Thr Val Tyr His Tyr Leu 
450 455 

ATC AGC CAA GAC AAG GCG GGC AAG TAC TGC ATT CCC GAG GGC ACC AAG 

He Ser Gin Asp Lys Ala Gly Lys Tyr Cys He Pro Glu Gly Thr Lys 
465 470 

^ 475 480 



TTT GAC ACG CTC TGG CAG CTG GTG GAG TAT CTG AAG CTG AAG GCG GAC 
Phe Asp Thr Leu Trp Gin Leu Val Glu Tyr Leu Lys Leu Lys Ala Asp 

485 490 

GGG CTC ATC TAC TGC CTG AAG GAG GCC TGC CCC AAC AGC AGT GCC AGC 
Gly Leu He Tyr Cys Leu Lys Glu Ala Cys Pro Asn Ser Ser Ala Ser 

500 505 510 

AAC GCC TCA GGG GCT GCT GCT CCC ACA r-'^ — - - 



1056 



1104 



1152 



1200 



1248 



1296 



GCG CAG ACC .AC GGC AAG TTC CTG CTG AGG CCG CGG AAG GAG CAG GGC 1344 
Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg Lys Glu Gin Gly 
435 440 



1392 



1440 



1488 



1536 



A 



530 



35 



.eu Asn Ser Asp Gly Tyr 
540 



ACC CCT GAG CC 
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Thr Pro Glu Pro Ala Arg He Thr Ser Pro Asp Lys Pro Arg Pro Met 

550 555 560 

CCC ATG GAC ACQ AGC GTG TAT GAG AGO CCC TAG AGO GAG CCA GAG GAG 
Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser Asp Pro Glu Glu 

565 570 

CTC AAG GAC AAG AAG CTC TTC CTG AAG CGC GAT AAC CTC CTC ATA GCT 
Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn Leu Leu He Ala 

580 585 

GAC ATT GAA CTT GGC TGC GGC -C TTT GGC TCa GTG CGC CAG GGC GTG 
Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val Arg Gin Gly Val 
595 600 605 

TAC CGC ATG CGC AAG AAG CAG ATC GAC GTG GCC ATC AAG GTG CTG AAG 
Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He Lya Val Leu Lvs 

615 620 

CAG GGC ACG GAG AAG GCA GAC ACG GAA GAG ATG ATG CGC GAG GCG CAG 
Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met Arg Glu Ala Gin 

625 630 £nt: 

635 640 

ATC ATG CAC CAG CTG GAC AAC CCC TAC ATC GTG CGG CTC ATT GGC GTC 
He Met His Gin Leu Asp Asn Pro Tyr He Val Arg Leu He Gly Val 

650 655 

TGC CAG GCC GAG GCC CTC ATG CTG GTC ATG GAG ATG GCT GGG GGC GGG 
Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met Ala Gly Gly Gly 

660 665 670 



AAT GTG GCC GAG CTG CTG CAC CAG GTG TCC ATG GGG ATG AAG TAC CTG 

Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly Met Lys Tyr Leu 

695 700 

GAG GAG AAG AAC TTT GTG CAC CGT GAC CTG GCG GCC CGC AAC GTC CTG 

Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 

"^05 7in TTP- 

/J-U 715 720 

CTG GTT AAC CGG CAC TAC GCC AAG ATC AGC GAC TTT GGC CTC TCC AAA 

Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe Gly Leu Ser Lys 

725 730 T^t:; 



.C3 Crz AAG TGG TAC GCA CCC GAA TGC ATC AAC TTC CGC AAG TTC 
Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn Phe Arg Lys Phe 
'^55 760 ^rr'^ 
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1728 



1776 



1824 



1872 



1920 



1968 



2016 



CCG CTG CAC AAG TTC CTG GTC GGC AAG AGG GAG GAG ATC CCT GTG AGC 2 064 
Pro Leu His Lys Phe Leu Val Gly Lys Arg Glu Glu He Pro Val Ser 
675 SBO 685 



2112 



2160 



2208 
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Ser Ser Arg Ser Asp Val 
770 

TTG TCC TAC GGC CAG AAG 
Leu Ser Tyr Gly Gin Lys 
785 790 

ATG GCC TTC ATC GAG CAG 
Met Ala Phe lie Glu Gin 

805 

CCA CCC GAA CTG TAC GCA 
Pro Pro Glu Leu Tyr Ala 

GAG GAT CGC CCC GAC TTC 
Glu Asp Arg Pro Asp Phe 
835 

TAC TAC AGC CTG GCC AGC 
Tyr Tyr Ser Leu Ala Ser 
850 

AAG GCT GAG GCT GCC TGT 
Lys Ala Glu Ala Ala Cys 
865 870 



196 

Trp Ser Tyr Gly Val Thr 

CCC TAC AAG AAG ATG AAA 
Pro Tyr Lys Lys Met Lys 

795 

GGC AAG CGG ATG GAG TGC 
Gly Lys Arg Met Glu Cys 

810 

CTC ATG AGT GAC TGC TGG 
Leu Met Ser Asp Cys Trp 
825 

CTG ACC GTG GAG CAG CGC 
Leu Thr Val Glu Gin Arg 
840 

AAG GTG GAA GGG CCC CCA 
Lys Val Glu Gly Pro Pro 
855 

GCC TGA 
Ala 



Met Trp Glu Ala 



GGG CCG GAG GTC 24 0 0 
Gly Pro Glu Val 

800 

CCA CCA GAG TGT 244 8 
Pro Pro Glu Cys 
815 

ATC TAC AAG TGG 2 4 96 
He Tyr Lys Trp 
830 

ATG CGA GCC TGT 2 544 

Met Arg Ala Cys 

845 

GGC AGC ACA CAG 2 592 
Gly Ser Thr Gin 



2616 



(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

^5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trr P-^ -T"^- t r... - -^ 



9C 9 5 

^rg .nr ::.e Phe Phe Lys Asp Asp Gly Acn Tyr Lys Thr Arg Ala Gl 

100 105 110 

Val Lys Phe Glu Glv Apr t-v- t o-- , 



u 
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115 

He Asp Phe 
13 0 

Asn Tyr Asn 
5 145 

Gly He Lys 

Val Gin Leu 

10 Pro Val Leu 

195 

Ser Lys Asp 
210 

Val Thr Ala 
15 225 

Gly Leu Arg 

Ala Ala His 

20 Glu Glu His 

275 

Arg Gin Cys 

290 
Asp Val Arg 
25 305 

Tyr Ala He 

Glu Phe Tyr 

30 Pro Cys Asn 

355 

Cys Leu Arg 
370 

Leu Glu Gly 
35 385 

Val Glu Lys 

His Ser Ser 

40 Ala Gin Thr 

435 

Thr Tyr Ala 

450 
He Ser Gin 
45 465 

Phe Asp Thr 

Gly Leu He 



120 

Lys Glu Asp Gly Asn 

135 

Ser His Asn Val Tyr 
150 

Val Asn Phe Lys He 
165 

Ala Asp His Tyr Gin 
180 

Leu Pro Asp Asn His 

200 

Pro Asn Glu Lys Arg 

215 

Aid Gly lie Thr Leu 
230 

Ser Arg Ala Gin Ala 
245 

Leu Pro Phe Phe Tyr 
260 

Leu Lys Leu Ala Gly 

280 

Leu Arg Ser Leu Gly 

295 

Phe His His Phe Pro 
310 

Ala Gly Gly Lys Ala 
325 

Ser Arg Asp Pro Asp 
340 

Arg Pro Ser Gly Leu 

360 

Asp Ala Met Val Arg 

375 

Glu Ala Leu Glu Gin 
390 

Leu He Ala Thr Thr 
405 

Leu Thr Arg Glu Glu 
420 

Asp Gly Lys Phe Leu 

440 

Leu Ser Leu He Tyr 

455 

Asp Lys Ala Gly Lys 
470 

Leu Trp Gin Leu Val 
485 

Tyr Cys Leu Lys Glu 



197 

He Leu Gly His 

140 

He Met Ala Asp 
155 

Arg His Asn He 
170 

Gin Asn Thr Pro 
185 

Tyr Leu Ser Thr 

Asp His Met Val 

220 

Gly Met Asp Glu 
235 

Ser Asn Ser Ala 
250 

Gly Ser He Ser 
265 

Met Ala Asp Gly 

Gly Tyr Val Leu 

300 

He Glu Arg Gin 
315 

His Cys Gly Pro 
330 

Gly Leu Pro Cys 
345 

Glu Pro Gin Pro 

Asp Tyr Val Arg 

380 

Ala He He Ser 
395 

Ala His Glu Arg 
410 

Ala Glu Arg Lys 
425 

Leu Arg Pro Arg 

Gly Lys Thr Val 

460 

Tyr Cys He Pro 
475 

Glu Tyr Leu Lys 
490 

Ala Tvr P^^ r. 



125 

Lys Leu Glu Tyr 

Lys Gin Lys Asn 

160 

Glu Asp Gly Ser 
175 

He Gly Asp Gly 
190 

Gin Ser Ala Leu 
205 

Leu Leu Glu Phe 

Leu Tyr Lys Ser 

240 

Met Pro Asp Pro 
255 

Arg Ala Glu Ala 
270 

Leu Phe Leu Leu 
285 

Ser Leu Val His 

Leu Asn Gly Thr 

320 

Ala Glu Leu Cys 

O J D 

Asn Leu Arg Lys 
350 

Gly Val Phe Asp 
365 

Gin Thr Trp Lys 

Gin Ala Pro Gin 

400 

Met Pro Trp Tyr 
415 

Leu Tyr Ser Gly 
430 

Lys Glu Gin Gly 
445 

Tyr His Tyr Leu 

Glu Gly Thr Lys 

480 

Leu Lys Ala Asp 
495 



55 



Thr 
545 



"O 



Pro Met As:: 



Pre Ala Arg He 
550 

Thr Ser Va: 



hr 



Sc 



540 

Pro Asp Lys Pro Arg Pro Met 
555 560 
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565 570 575 

Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn Leu Leu He Ala 

580 585 590 

Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val Arg Gin Gly Val 

595 600 605 

Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He Lys Val Leu Lys 

610 615 620 

Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met Arg Glu Ala Gin 
625 630 635 640 

He Met His Gin Leu Asp Asn Pro Tyr He Val Arg Leu He Gly Val 

645 650 655 

Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met Ala Gly Gly Gly 

660 665 670 

Pro Leu Hi? Lys Phe Leu Vdl Gly Lys Arg Glu Glu He Pro Val Ser 
15 675 680 685 

Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly Met Lys Tyr Leu 

690 695 700 

Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 
705 710 715 720 

Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe Gly Leu Ser Lys 

725 730 735 

Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg Ser Ala Gly Lys 

740 745 750 

Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn Phe Arg Lys Phe 
25 755 760 765 

Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr Met Trp Glu Ala 

770 775 780 

Leu Ser T>'r Gly Gin Lys Pro Tyr Lys Lys Met Lys Gly Pro Glu Val 
785 790 795 800 

Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys Pro Pro Glu Cys 

805 810 815 

Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp He Tyr Lys Trp 

820 825 830 

Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg Met Arg Ala Cys 
35 835 840 845 

Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro Gly Ser Thr Gin 

850 855 860 

Lys Ala Glu Ala Ala Cys Ala 
865 870 



20 



30 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 110 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2598 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



— A ^ ^ V. ' , . . - . . ^ b 9 b 
;D) other INFORMATION: 

XI} SEQUENCE DESC? T PTTOr: : 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 PCT/DK98/00145 

199 

ATG CCA GAG CCC GCG GCG CAC CTG CCC TTC TTC TAG GGG AGC ATC TCG 4 8 

Met Pro Asp Pro Ala Ala His Leu Pro Phe Phe Tyr Gly Ser lie Ser 
15 10 15 

5 

CGT GCC GAG GCC GAG GAG CAC CTG AAG CTG GCG GGC ATG GCG GAC GGG 96 

Arg Ala Glu Ala Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly 

20 25 30 

10 CTC TTC CTG CTG GGC CAG TGC CTG CGC TCG CTG GGC GGC TAT GTG CTG 14 4 

Leu Phe Leu Leu Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu 
35 40 45 



15 



45 



TCG CTC GTC CAC GAT GTG CGC TTC CAC CAC TTT CCC ATC GAG CGC CAG 192 
Ser Leu Val His Asp Val Arg Phe His His Phe Pro lie Glu Arg Gin 
50 55 60 



CTC AAC GGC AGC TAG GCG ATT GGC GGC GGC AAA GCG CAC TGT GGA CCG 24 0 

Leu Asn Gly Thr Tyr Ala He Ala Gly Gly Lys Ala His Gys Gly Pro 

20 65 70 75 80 

GCA GAG CTC TGC GAG TTC TAG TCG CGC GAC GGG GAC GGG CTG CCG TGC 28 8 

Ala Glu Leu Cys Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys 

85 90 95 

25 

AAC CTG CGC AAG CCG TGC AAC CGG CCG TCG GGG CTC GAG GGG CAG CCG 3 36 

Asn Leu Arg Lys Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro 

100 105 110 

30 GGG GTG TTC GAC TGC CTG GGA GAC GGC ATG GTG CGT GAC TAG GTG CGC 3 84 

Gly Val Phe Asp Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arg 

115 120 125 

CAG ACG TGG AAG CTG GAG GGC GAG GGC CTG GAG CAG GCC ATC ATC AGC 4 32 

35 Gin Thr Trp Lys Leu Glu Gly Glu Ala Leu Glu Gin Ala He He Ser 

130 135 140 

CAG GCC CCG CAG GTG GAG AAG CTC ATT GCT ACG ACG GCC CAC GAG CGG 480 

Gin Ala Pro Gin Val Glu Lys Leu He Ala Thr Thr Ala His Glu Arg 

^0 145 150 155 160 

ATG CCC TGG TAC CAC AGC AGC CTG ACG CGT GAG GAG GCC GAG CGC AAA 52 8 

Met Pro Trp Tyr His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys 

165 170 175 



CTT TAG TCT GGG GCG CAG ACC GAC GGC AAG TTC CTG CTG AGG CCG CGG 57 6 

Leu Tyr Ser Gly Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg 

180 185 ^, 9^^ 



C^C TAC CTC ATC AGC CAA GAC AAG GCG GGC AAG TAC TGC ATT CCC 672 
55 Tyr His Tyr Leu He Ser Gin Asp Lys Ala Gly Lys Tyr Cys He Pro 
210 215 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 
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10 



15 



25 



30 



35 



45 



200 

GAG GGC ACC AAG TTT GAC ACG CTC TGG GAG CTG GTG GAG TAT CTG AAG 72 0 

Glu Gly Thr Lys Phe Asp Thr Leu Trp Gin Leu Val Glu Tyr Leu Lys 
225 230 235 240 

CTG AAG GCG GAC GGG CTC ATC TAC TGC CTG AAG GAG GCC TGC CCC AAC 768 

Leu Lys Ala Asp Gly Leu lie Tyr Cys Leu Lys Glu Ala Cys Pro Asn 

245 250 255 

AGC AGT GCC AGC AAC GCC TCA GGG GCT GCT GCT CCC ACA CTC CCA GCC 816 

Ser Ser Ala Ser Asn Ala Ser Gly Ala Ala Ala Pro Thr Leu Pro Ala 

260 265 270 

CAC CCA TCP ACG TTG ACT CAT OCT CAG AGA CGA ATC GAC ACC CTC AAC 8 64 

His Pro Ser Thr Leu Thr His Pro Gin Arg Arg He Asp Thr Leu Asn 
275 280 285 



TCA GAT GGA TAC ACC CCT GAG CCA GCA CGC ATA ACG TCC CCA GAC AAA 912 

Ser Asp Gly Tyr Thr Pro Glu Pro Ala Arg He Thr Ser Pro Aap Lys 
20 290 295 300 

CCG CGG CCG ATG CCC ATG GAC ACG AGC GTG TAT GAG AGC CCC TAC AGC 960 

Pro Arg Pro Met Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser 

305 310 315 320 



GAC CCA GAG GAG CTC AAG GAC AAG AAG CTC TTC CTG AAG CGC GAT AAC 1008 

Asp Pro Glu Glu Leu Lys Asp Lye Lys Leu Phe Leu Lva Ara Asxi Asn 

330 335 

CTC CTC ATA GCT GAC ATT GAA CTT GGC TGC GGC AAC TTT GGC TCA GTG 105 6 

Leu Leu He Ala Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val 

340 345 350 

CGC CAG GGC GTG TAC CGC ATG CGC AAG AAG CAG ATC GAC GTG GCC ATC 1104 

Arg Gin Gly Val Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He 

355 360 365 



AAG GTG CTG AAG CAG GGC ACG GAG AAG GCA GAC ACG GAA GAG ATG ATG 115 2 

Lys Val Leu Lys Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met 

^0 370 375 380 

CGC GAG GCG CAG ATC ATG CAC CAG CTG GAC AAC CCC TAC ATC GTG CGG 12 00 

Arg Glu Ala Gin He Met His Gin Leu Asp Asn Pro Tyr He Val Arg 

385 390 395 400 



CTC ATT GGC GTC TGC CAG GCC GAG GCC CTC ATG CTG GTC ATG GAG ATG 12 4 8 

Leu He Gly Val Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met 

405 410 4^^ 



ATC r^TT GTG AGC AAT GTG GCC GAG CTG CTG CAC CAG GTG TCC ATG GGG 1344 
55 :le Pro Val Ser Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly 

4 3 5 44^ - 



SUBSTITUTE SHEET (HULE 26) 
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201 



ATG AAG TAG CTG GAG GAG AAG AAC TTT GTG CAC CGT GAG CTG GCG GCC 13 92 

Met Lys Tyr Leu Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala 
450 455 460 

5 

CGC AAC GTC CTG CTG GTT AAC CGG CAC TAC GCC AAG ATC AGC GAC TTT 144 0 
Arg Asn Val Leu Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe 
465 470 475 460 

10 GGC CTC TCC AAA GCA CTG GGT GCC GAC GAC AGC TAC TAC ACT GCC CGC 14 8 8 

Gly Leu Ser Lys Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg 

485 490 495 



15 



35 



45 



TCA GPA GGG AAG TGG CCG UTC AAG TGG TAC GCA CCC GAA TGC ATC AAC 153 6 

Ser Ala Gly Lys Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn 

500 505 510 



TTC CGC AAG TTC TCC AGC CGC AGC GAT GTC TGG AGC TAT GGG GTC ACC 15 84 

Phe Arg Lys Phe Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr 

20 515 520 525 

ATG TGG GAG GCC TTG TCC TAC GGC CAG AAG CCC TAC AAG AAG ATG AAA 163 2 

Met Trp Glu Ala Leu Ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys 
530 535 540 

25 

GGG CCG GAG GTC ATG GCC TTC ATC GAG CAG GGC AAG CGG ATG GAG TGC 168 0 

Gly Pro Glu Val Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys 

^45 550 555 560 

30 CCA CCA GAG TGT CCA CCC GAA CTG TAC GCA CTC ATG AGT GAC TGC TGG 172 8 

Pro Pro Glu Cys Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp 

565 570 575 



ATC TAC AAG TGG GAG GAT CGC CCC GAC TTC CTG ACC GTG GAG CAG CGC 177 6 

He Tyr Lys Trp Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg 

580 585 590 



ATG CGA GCC TGT TAC TAC AGC CTG GCC AGC AAG GTG GAA GGG CCC CCA 182 4 

Met Arg Ala Cys Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro 
40 595 600 605 

GGC AGC ACA CAG AAG GCT GAG GCT GCC TGT GCC TGG GAT CCA CCG GTC 187 2 

Gly Ser Thr Gin Lys Ala Glu Ala Ala Cys Ala Trp Asp Pro Pro Val 
610 615 620 



GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC 192 0 

Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
625 630 f-m r ^ 



55 



ri'C GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG 
Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 



6 6 0 r r 
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wo 98/45704 



PCT/DK98/00145 



202 

TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG 2 0 64 

Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 
675 680 685 

5 

ACC ACC CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC 2112 
Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 
690 695 700 

10 ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAG GTC 216 0 
Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
705 710 715 720 



15 



25 



30 



35 



45 



CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAG AAG ACC CGC 220 8 
Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 

725 730 735 



GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG 225 6 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 

20 740 745 750 

AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG 23 04 

Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu 

755 760 765 



GAG TAC AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG 2 352 
Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin 
770 775 780 

AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC 24 00 
Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp 
705 790 795 800 

GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC 244 8 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 

805 810 815 



GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC 2 4 96 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 

^0 820 825 830 

GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG 2 54 4 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 
835 840 845 



GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC 2 5 92 

Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr 
850 855 Pf^n 



55 (2) INFORMATION FOR SEQ IH NO ; 1 1 1 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 
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10 



20 



30 



40 



203 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 



Met Pro Asp Pro Ala Ala His Leu Pro Phe Phe Tyr Gly Ser lie Ser 

15 10 15 

Arg Ala Glu Ala Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly 
15 20 25 30 

Leu Phe Leu Leu Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu 

35 40 45 

Ser Leu Val His Asp Val Arg Phe His His Phe Pro lie Glu Arg Gin 

50 55 60 

Leu Asn Gly Thr Tyr Ala He Ala Gly Gly Lys Ala His Cys Gly Pro 
65 70 75 80 

Ala Glu Leu Cys Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys 

85 90 95 

Asn Leu Arg Lys Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro 
25 100 105 110 

Gly Val Phe Asp Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arg 

115 120 125 

Gin Thr Trp Lys Leu Glu Gly Glu Ala Leu Glu Gin Ala He He Ser 

130 135 140 

Gin Ala Pro Gin Val Glu Lys Leu He Ala Thr Thr Ala His Glu Arg 
145 150 155 160 

Met Pro Trp Tyr His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys 

165 170 175 

Leu Tyr Ser Gly Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg 
35 180 185 190 

Lys Glu Gin Gly Thr Tyr Ala Leu Ser Leu He Tyr Gly Lys Thr Val 

195 200 205 

Tyr His Tyr Leu He Ser Gin Asp Lys Ala Gly Lys Tyr Cys He Pro 

210 215 220 

Glu Gly Thr Lys Phe Asp Thr Leu Trp Gin Leu Val Glu Tyr Leu Lys 
225 230 235 240 

Leu Lys Ala Asp Gly Leu He Tyr Cys Leu Lys Glu Ala Cys Pro Asn 

245 250 255 

Ser Ser Ala Ser Asn Ala Ser Gly Ala Ala Ala Pro Thr Leu Pro Ala 

260 265 270 

His Pro Ser Thr Leu Thr His Pro Gin Arg Arg He Asp Thr Leu Asn 

275 280 285 

Ser Asp Gly Tyr Thr Pro Glu Pro Ala Ara T^e Thr 9r>r 



55 



3 3 0 3 3 5 

:..GU Leu He Ala Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val 



34 0 



345 



350 



Arg Gin Gly Val Tyr Ara r-'et A 
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355 

Lys Val Leu Lys 
370 

Arg Glu Ala Gin 
5 385 

Leu lie Gly Val 

Ala Gly Gly Gly 

420 

10 lie Pro Val Ser 

435 

Met Lys Tyr Leu 
450 

Arg Asn Val Leu 
15 465 

Gly Leu Ser Lys 

Ser Ala Gly Lys 

500 

20 Phe Arg Lys Phe 

515 

Met Trp Glu Ala 
530 

Gly Pro Glu Val 
25 545 

Pro Pro Glu Cys 

He Tyr Lys Trp 

580 

30 Met Arg Ala Cys 

595 

Gly Ser Thr Gin 
610 

Ala Thr Met Val 
35 625 

He Leu Val Glu 

Ser Gly Glu Gly 

660 

40 Phe He Cys Thr 

675 

Thr Thr Leu Thr 
690 

Met Lys Gin His 
45 705 

Gin Glu Arg Thr 

Ala Glu Val Lys 



204 

360 



Gin 


Gly 


Thr 


Glu 


Lys 


Ala 






375 








He 


Met 


His 


Gin 


Leu 


Asp 




390 










Cys 


Gin 


Ala 


Glu 


Ala 


Leu 


405 










410 


Pro 


Leu 


His 


Lys 


Phe 


Leu 










425 




Asn 


Val 


Ala 


Glu 


Leu 


Leu 








440 






Glu 


Glu 


Lys 


Asn 


Phe 


Val 






455 








L^ u 




Asn 


Arg 


His 


Tyr 




470 










Ala 


Leu 


Gly Ala Asp 


Asp 


485 










490 


Trp 


Pro 


Leu 


Lys 


Trp 


Tyr 










505 




Ser 


Ser 


Arg 


Ser 


Asp 


Val 








520 






Leu 


Ser 


Tyr 


Gly 


Gin 


Lys 






535 








Met 


Ala 


Phe 


He 


Glu 


Gin 




550 










Pro 


Pro 


Glu 


Leu 


Tyr 


Ala 


565 










570 


-> 


Asp 


Arg 


Pro 


Asp 


Phe 










585 




Tyr 


Tyr 


Ser 


Leu 


Ala 


Ser 








600 






Lys 


Ala 


Glu 


Ala 


Ala 


Cys 






615 








Ser 


Lys 


Gly Glu 


Glu 


Leu 




630 










Leu 


Asp 


Gly Asp 


Val 


Asn 


645 










650 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 










665 




Thr 


Gly 


Lys 


Leu 


Pro 


Val 








680 






Tyr 


Gly 


Val 


Gin 


Cys 


Phe 






695 








Asp 


Phe 


Phe 


Lys 


Ser 


Ala 




710 










He 


Phe 


Phe 


Lys 


Asp 


Asp 


725 










730 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 



365 

Asp Thr Glu Glu Met Met 
380 

Asn Pro Tyr He Val Arg 
395 400 
Met Leu Val Met Glu Met 

415 

Val Gly Lys Arg Glu Glu 

430 

His Gin Val Ser Met Gly 
445 

His Arg Asp Leu Ala Ala 
460 

Ala Lys He Ser Asp Phe 
475 480 
Ser Tyr Tyr Thr Ala Arg 

495 

Ala Pro Glu Cys He Asn 

510 

Trp Ser Tyr Gly Val Thr 
525 

Pro Tyr Lys Lys Met Lys 
540 

Gly Lys Arg Met Glu Cys 
555 560 
Leu Met Ser Asp Cys Trp 

Leu Thr Val .Glu Gin Arg 

590 

Lys Val Glu Gly Pro Pro 
605 

Ala Trp Asp Pro Pro Val 
620 

Phe Thr Gly Val Val Pro 
635 640 

Gly His Lys Phe Ser Val 

655 

Gly Lys Leu Thr Leu Lys 

670 

Pro Trp Pro Thr Leu Val 
685 

Ser Arg Tyr Pro Asp His 
700 

Met Pro Glu Gly Tyr Val 
715 720 
Gly Asn Tyr Lys Thr Arg 

735 

Val A'=;". A^-T ^"-^ ' 



Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu As 



78 C 

?i cr n ^ 1 D r:- 1 • 1 ^ p 



55 785 790 795 800 

Gly Ser Val Gin Leu Ala Am Hi- ^ - •: r ■■ ■ - ■ ' 
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10 



25 



45 



205 

805 810 815 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 

820 825 830 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 

835 840 845 

Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr 
850 855 860 

Lys 
865 

(2) INFORMATION FOR SEQ ID NO: 112: 



(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 163 5 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
20 (ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1632 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



ATG GAG AAC TTC CAA AAG GTG GAA AAG ATC GGA GAG GGC ACG TAG GGA 4 8 

Met Glu Asn Phe Gin Lys Val Glu Lys He Gly Glu Gly Thr Tyr Gly 
30 1 5 10 15 

GTT GTG TAC AAA GCC AGA AAC AAG TTG ACG GGA GAG GTG GTG GCG CTT 96 

Val Val Tyr Lys Ala Arg Asn Lys Leu Thr Gly Glu Val Val Ala Leu 

20 25 30 

35 

AAG AAA ATC CGC CTG GAC ACT GAG ACT GAG GGT GTG CCC AGT ACT GCC 144 

Lys Lys He Arg Leu Asp Thr Glu Thr Glu Gly Val Pro Ser Thr Ala 

35 40 45 

40 ATC CGA GAG ATC TCT CTG CTT AAG GAG CTT AAC CAT CCT AAT ATT GTC 192 

He Arg Glu He Ser Leu Leu Lys Glu Leu Asn His Pro Asn He Val 
50 55 60 



AAG CTG CTG GAT GTC ATT CAC ACA GAA AAT AAA CTC TAC CTG GTT TTT 24 0 

Lys Leu Leu Asp Val He His Thr Glu Asn Lys Leu Tyr Leu Val Phe 
65 70 75 BO 



GAA TTT CTG CAC CAA GAT CTC AAG AAA TTC A"^^ HA'^ ^"^-^ 



100 



I'^e L'/s Ser T\'r l..eu Phe G^r. Leu Leu 
10 5 110 



55 



.Mo 



^ v- L 1 A C 7 L - I ... . ■ .... ;./■.. 
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Gin Gly Leu Ala Phe Cys His Ser His Arg Val Leu His Arg Asp Leu 
115 120 125 

AAA CCT GAG AAT CTG CTT ATT AAC ACA GAG GGG GCC ATC AAG CTA GCA 4 32 

5 Lys Pro Gin Asn Leu Leu He Asn Thr Glu Gly Ala He Lys Leu Ala 
130 135 140 

GAC TTT GGA CTA GCC AGA GCT TTT GGA GTC CCT GTT CGT ACT TAG ACC 4 80 

Asp Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Val Arg Thr Tyr Thr 
10 145 150 155 160 

CAT GAG GTG GTG ACC CTG TGG TAC CGA GCT CCT GAA ATC CTC CTG GGC 52 8 

His Glu Val Val Thr Leu Trp Tyr Arg Ala Pro Glu He Leu T,e" Gly 

165 170 175 

15 

TCG AAA TAT TAT TCC ACA GCT GTG GAC ATC TGG AGC CTG GGC TGC ATC 5 76 

Ser Lys Tyr Tyr Ser Thr Ala Val Asp lie Trp Ser Leu Gly Cys He 

180 185 190 

20 TTT GCT GAG ATG GTG ACT CGC CGG GCC CTG TTC CCT GGA GAT TCT GAG 624 
Phe Ala Glu Met Val Thr Arg Arg Ala Leu Phe Pro Gly Asp Ser Glu 
195 200 205 

ATT GAC CAG CTC TTC CGG ATC TTT CGG ACT CTG GGG ACC CCA GAT GAG 67 2 

25 He Asp Gin Leu Phe Arg He Phe Arg Thr Leu Gly Thr Pro Asp Glu 
210 215 220 

GTG GTG TGG CCA GGA GTT ACT TCT ATG CCT GAT TAC AAG CCA AGT TTC 72 0 

Val Val Trp Pro Gly Val Thr Ser Met Pro Asp Tyr Lys Pro Ser Phe 
30 225 230 235 240 

CCC AAG TGG GCC CGG CAA GAT TTT AGT AAA GTT GTA CCT CCC CTG GAT 768 
Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys Val Val Pro Pro Leu Asp 

245 250 255 

35 

GAA GAT GGA CGG AGC TTG TTA TCG CAA ATG CTG CAC TAC GAC CCT AAC 816 
Glu Asp Gly Arg Ser Leu Leu Ser Gin Met Leu His Tyr Asp Pro Asn 

260 265 270 

40 AAG CGG ATT TCG GCC AAG GCA GCC CTG GCT CAC CCT TTC TTC CAG GAT 864 
Lys Arg He Ser Ala Lys Ala Ala Leu Ala His Pro Phe Phe Gin Asp 
275 280 285 



45 



GTG ACC AAG CCA GTA CCC CAT CTT CGA CTC TGG GAT CCA CCG GTC GCC 912 
Val Thr Lys Pro Val Pro His Leu Arg Leu Trp Asp Pro Pro Val Ala 
290 295 300 



ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC Hn^- ^t- 



55 



325 

^ ^ /—\ ^ ^ ^ i-p, 

■^^O^.. OOv, v-J 1 



Asp 'j'a^ Asn Gly His Lys Phe Ser Val Ser 

330 335 
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Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 

340 345 350 

ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC 1104 
5 lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

355 360 365 

ACC CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG 1152 
Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
10 370 375 380 

AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG 12 0 0 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
385 390 395 400 

15 

GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC 124 8 
Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 

405 410 415 

20 GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG 12 96 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 

420 425 430 

GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG 1344 
25 Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 

435 440 445 

TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG 13 92 
Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
30 450 455 460 

AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC 144 0 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 

465 470 475 480 

35 

AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC 14 8 8 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 

485 490 495 

40 GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC 15 3 6 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 

500 505 510 

CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG 15 8 4 
45 Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 

515 520 525 



TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAH TT^^ 



55 



i:;format:o: 



FOR 



SEQ 13 NO : 1 13 



SEQUENCE ^HAPAO^FP. TFT' 



SUBSTITUTE SHEET (RULE 26) 
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10 



45 



208 

(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



Met Glu Asn Phe Gin Lys Val Glu Lys lie Gly Glu Gly Thr Tyr Gly 

15 10 15 

Val Val Tyr Lys Ala Arg Asn Lys Leu Thr Gly Glu Val Val Alfi Leu 

20 25 30 

15 Lys Lys lie Arg Leu Asp Thr Glu Thr Glu Gly Val Pro Ser Thr Ala 

35 40 45 

lie Arg Glu lie Ser Leu Leu Lys Glu Leu Asn His Pro Asn lie Val 

50 55 60 

Lys Leu Leu Asp Val lie His Thr Glu Asn Lys Leu Tyr Leu Val Phe 
20 65 70 75 80 

Glu Phe Leu His Gin Asp Leu Lys Lys Phe Met Asp Ala Ser Ala Leu 

85 90 95 

Thr Gly lie Pro Leu Pro Leu lie Lys Ser Tyr Leu Phe Gin Leu Leu 

100 105 110 

25 Gin Gly Leu Ala Phe Cys His Ser His Arg Val Leu His Arg Asp Leu 

115 120 125 

Lys Pro Gin Asn Leu Leu He Asn Thr Glu Gly Ala He Lys Leu Ala 

130 135 140 

Asp Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Val Arg Thr Tyr Thr 
30 145 150 155 160 

His Glu Val Val Thr Leu Trp Tyr Arg Ala Pro Glu He Leu Leu Gly 

165 170 175 

Ser Lys Tyr Tyr Ser Thr Ala Val Asp He Trp Ser Leu Gly Cys He 

180 185 190 

35 Phe Ala Glu Met Val Thr Arg Arg Ala Leu Phe Pro Gly Asp Ser Glu 

195 200 205 

He Asp Gin Leu Phe Arg He Phe Arg Thr Leu Gly Thr Pro Asp Glu 

210 215 220 

Val Val Trp Pro Gly Val Thr Ser Met Pro Asp Tyr Lys Pro Ser Phe 
40 225 230 235 240 

Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys Val Val Pro Pro Leu Asp 

245 250 255 

Glu Asp Gly Arg Ser Leu Leu Ser Gin Met Leu His Tyr Asp Pro Asn 

260 265 270 

Lys Arg He Ser Ala Lys Ala Ala Leu Ala His Pro Phe Phe Gin Asp 

275 280 285 

Val Thr Lys Pro Val Pro His Leu Arg Leu Trp Asp Pro Pro Val Ala 
290 295 300 



55 



- ■■ ; /^..a In: "lyr G^y Ly^ Leu Thr Leu Lys Phe 
340 345 35 0 

He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
355 160 
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Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 

370 375 380 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
385 390 395 400 

5 Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 

405 410 415 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 

420 425 430 

Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu 
10 435 440 445 

Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 

450 455 460 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
465 470 475 480 

15 Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 

485 490 495 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 

500 505 510 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
20 515 520 525 

Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
530 535 540 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 114 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

35 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1632 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

45 GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 9 6 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 



55 



Cyc 

50 



.I ^ y 



ys L-eu Pro Val 
55 



ro Trp Pro Thr 
60 



Leu Val Thr Thr 
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CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAG CCC GAG GAG ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

5 GAG GAG GAG TTC TTC AAG TCC GGC ATG CGC GAA GGC TAG GTC CAG GAG 28 8 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

CGC ACC ATG TTC TTC AAG GAC GAC GGC AAG TAG AAG ACC GGC GGC GAG 336 
10 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAG CGC ATC GAG CTG AAG HGC 2S4 
Val Lys ?he Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
15 115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAG AAG CTG GAG TAG 4 32 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

20 

AAC TAG AAC AGC GAC AAC GTG TAT ATC ATG GGC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

25 GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

GTG CAG GTC GGC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

30 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAG TAG CTG AGC ACC CAG TCC GGC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
35 195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 6 72 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

40 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TCC 7 20 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

45 GGA CTC AGA TCT CGA GCC ATG GAG AAC TTC GAA AAG GTG GAA AAG ATC 768 
Gly Leu Arg Ser Arg Ala Met Glu Asn Phe Gin Lys Val Glu Lys He 

245 250 255 



^ 1 \' G ^ u V d. 1 V d. X Ala L e u L'v 
55 275 



Lys lie Arg 
280 



Asp Thr Glu Thr Glu 
285 



a b ^ 
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GGT GTG CCC AGT ACT GCC ATC CGA GAG ATC TCT CTG CTT AAG GAG CTT 912 

Gly Val Pro Ser Thr Ala lie Arg Glu lie Ser Leu Leu Lys Glu Leu 

290 295 300 

5 AAC CAT CCT AAT ATT GTC AAG CTG CTG GAT GTC ATT CAC ACA GAA AAT 960 

Asn His Pro Asn lie Val Lys Leu Leu Asp Val lie His Thr Glu Asn 

305 310 315 320 

AAA CTC TAC CTG GTT TTT GAA TTT CTG CAC CAA GAT CTC AAG AAA TTC 1008 

10 Lys Leu Tyr Leu Val Phe Glu Phe Leu His Gin Asp Leu Lys Lys Phe 

325 330 335 

ATG GAT GCC TCT GCT CTC ACT GGC ATT CCT CTT CCC CTC ATC AAG AGC 105 6 

Met Ann Ala Ser Ala Leu Thr Gly lie Pro Leu Pro Leu lie Lys Ser 

15 340 345 350 

TAT CTG TTC CAG CTG CTC CAG GGC CTA GCT TTC TGC CAT TCT CAT CGG 1104 

Tyr Leu Phe Gin Leu Leu Gin Gly Leu Ala Phe Cys His Ser His Arg 
355 360 365 

20 

GTC CTC CAC CGA GAC CTT AAA CCT CAG AAT CTG CTT ATT AAC ACA GAG 1152 

Val Leu His Arg Asp Leu Lys Pro Gin Asn Leu Leu He Asn Thr Glu 

370 375 380 

25 GGG GCC ATC AAG CTA GCA GAC TTT GGA CTA GCC AGA GCT TTT GGA GTC 12 00 

Gly Ala He Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Gly Val 

385 390 395 400 

CCT GTT CGT ACT TAC ACC CAT GAG GTG GTG ACC CTG TGG TAC CGA GCT 1248 

30 Pro Val Arg Thr Tyr Thr His Glu Val Val Thr Leu Trp Tyr Arg Ala 

405 410 415 

CCT GAA ATC CTC CTG GGC TCG AAA TAT TAT TCC ACA GCT GTG GAC ATC 12 96 

Pro Glu He Leu Leu Gly Ser Lys Tyr Tyr Ser Thr Ala Val Asp He 

35 420 425 430 

TGG AGC CTG GGC TGC ATC TTT GCT GAG ATG GTG ACT CGC CGG GCC CTG 13 44 

Trp Ser Leu Gly Cys He Phe Ala Glu Met Val Thr Arg Arg Ala Leu 
435 440 445 

40 

TTC CCT GGA GAT TCT GAG ATT GAC CAG CTC TTC CGG ATC TTT CGG ACT 13 92 

Phe Pro Gly Asp Ser Glu He Asp Gin Leu Phe Arg He Phe Arg Thr 

450 455 460 

45 CTG GGG ACC CCA GAT GAG GTG GTG TGG CCA GGA GTT ACT TCT ATG CCT 14 4 0 

Leu Gly Thr Pro Asp Glu Val Val Trp Pro Gly Val Thr Ser Met Pro 

465 470 475 480 



Val Val Pre Pre Leu Asp Giu Asp Gly Arg Ser Leu Leu Ser Gin Met 
55 500 505 510 
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CTG CAC TAG GAG CCT AAC AAG CGG ATT TCG GCC AAG GCA GCC CTG GCT 15 8 4 

Leu His Tyr Asp Pro Asn Lys Arg lie Ser Ala Lys Ala Ala Leu Ala 
515 520 525 

CAC CCT TTC TTC CAG GAT GTG ACC AAG CCA GTA CCC CAT CTT CGA CTC T 163 3 
His Pro Phe Phe Gin Asp Val Thr Lys Pro Val Pro His Leu Arg Leu 
530 535 540 

GA 1635 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 544 ammo acids 

15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
20 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
25 1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

30 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
35 85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

40 He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
45 165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Lgu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Per Ala : 



J 3 



24 C 

Giy Le'j. Arg Ser Arg Ala Met Glu Asn Phe Gin Lys Val Glu Lys He 
55 245 250 255 

Glv Glu Glv Thr Tvr G^v Val Val ^'/r" !.-.'<^ " ^ -"'^ ' 
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260 265 270 

Gly Glu Val Val Ala Leu Lys Lys lie Arg Leu Asp Thr Glu Thr Glu 

275 280 285 

Gly Val Pro Ser Thr Ala lie Arg Glu He Ser Leu Leu Lys Glu Leu 

290 295 300 

Asn His Pro Asn He Val Lys Leu Leu Asp Val He His Thr Glu Asn 
305 310 315 320 

Lys Leu Tyr Leu Val Phe Glu Phe Leu His Gin Asp Leu Lys Lys Phe 

325 330 335 

Met Asp Ala Ser Ala Leu Thr Gly lie Pro Leu Pro Leu He Lys Ser 

340 345 350 

Tyr Leu Phe Gin Leu Leu Gin Gly Leu Ala Phe Cys His Ser His Arg 

355 360 365 

Val Leu His Arg Asp Leu Lys Fro Gin Asn Leu Leu He Asn Thr Glu 
15 370 375 380 

Gly Ala He Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Gly Val 
385 390 395 400 

Pro Val Arg Thr Tyr Thr His Glu Val Val Thr Leu Trp Tyr Arg Ala 

405 410 415 

20 Pro Glu He Leu Leu Gly Ser Lys Tyr Tyr Ser Thr Ala Val Asp He 

420 425 430 

Trp Ser Leu Gly Cys He Phe Ala Glu Met Val Thr Arg Arg Ala Leu 

435 440 445 

Phe Pro Gly Asp Ser Glu He Asp Gin Leu Phe Arg He Phe Arg Thr 
25 450 455 460 

Leu Gly Thr Pro Asp Glu Val Val Trp Pro Gly Val Thr Ser Met Pro 
465 470 475 480 

Asp Tyr Lys Pro Ser Phe Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys 

485 490 495 

30 Val Val Pro Pro Leu Asp Glu Asp Gly Arg Ser Leu Leu Ser Gin Met 

500 505 510 

Leu His Tyr Asp Pro Asn Lys Arg He Ser Ala Lys Ala Ala Leu Ala 

515 520 525 

His Pro Phe Phe Gin Asp Val Thr Lys Pro Val Pro His Leu Arg Leu 
35 530 535 540 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 



ATrj G7C AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4Q 
55 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pre He Leu 
1 5 ic 
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30 



35 



40 



45 
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GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTP, CAG TGC TTC AGC CUC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

70 75 80 



CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

20 B5 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 33 6 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 X75 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 6 24 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



5^^ 



;g 



GGC ATG GAG GAG CTG TAC AAG TC; 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tvr Lys Se; 
225 230 



Tin; 
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GGA CTC AGA TCT CGA GAG ATG CTG TCC CGT GGG TGG TTT CAC CGA GAC 76 8 

Gly Leu Arg Ser Arg Glu Met Leu Ser Arg Gly Trp Phe His Arg Asp 

245 250 255 

CTC AGT GGG CTG GAT GCA GAG ACC CTG CTC AAG GGC CGA GGT GTC CAC 816 

Leu Ser Gly Leu Asp Ala Glu Thr Leu Leu Lys Gly Arg Gly Val His 

260 265 270 

GGT AGC TTC CTG GCT CGG CCC AGT CGC AAG AAC CAG GGT GAC TTC TCG 864 

Gly Ser Phe Leu Ala Arg Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser 
275 280 285 

CTC TCC GTC AGG QTG GGG GAT CAG GTG ACC CAT ATT CGG ATC CAG AAC 912 

Leu Ser Val Arg Val Gly Asp Gin Val Thr His lie Arg He Gin Asn 
290 295 300 



TCA GGG GAT TTC TAT GAC CTG TAT GGA GGG GAG AAG TTT GCG ACT CTG 960 
Ser Gly Asp Phe Tyr Asp Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu 
20 305 310 315 320 

ACA GAG CTG GTG GAG TAG TAG ACT CAG CAG CAG GGT GTC CTG CAG GAC 100 8 
Thr Glu Leu Val Glu Tyr Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp 

325 330 335 



CGC GAC GGC ACC ATC ATC CAC CTC AAG TAG CCG CTG AAC TGC TCC GAT 105 6 

Arg Asp Gly Thr He He His Leu Lys Tyr Pro Leu Asn Cys Ser Asp 

340 345 350 

CCC ACT AGT GAG AGG TGG TAG CAT GGC CAC ATG TCT GGC GGG CAG GCA 1104 

Pro Thr Ser Glu Arg Trp Tyr His Gly His Met Ser Gly Gly Gin Ala 

355 360 365 

GAG ACG CTG CTG CAG GCC AAG GGC GAG CCC TGG ACG TTT CTT GTG CGT 1152 

Glu Thr Leu Leu Gin Ala Lys Gly Glu Pro Trp Thr Phe Leu Val Arg 

370 375 380 



GAG AGC CTC AGC CAG CCT GGA GAC TTC GTG CTT TCT GTG CTC AGT GAC 12 00 

Glu Ser Leu Ser Gin Pro Gly Asp Phe Val Leu Ser Val Leu Ser Asp 
385 390 395 400 

CAG CCC AAG GCT GGC CCA GGC TCC CCG CTC AGG GTC ACC CAC ATC AAG 124 8 

Gin Pro Lys Ala Gly Pro Gly Ser Pro Leu Arg Val Thr His He Lys 

405 410 415 



GTC ATG TGC GAG GGT GGA CGC TAG ACA GTG GGT GGT TTG GAG ACC TTC 12 96 

Val Met Cys Glu Gly Gly Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe 

420 425 430 



3 AG CCC TCA GGC GCC TTT GTC TAC CTG CGG CAG CCG TAC TAT GCC ACG 13 92 

55 Glu Ala Ser Gly Ala Phe Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr 
450 455 4 f^" 
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AGG GTG AAT GCG GCT GAG ATT GAG AAC CGA GTG TTG GAA CTG AAC AAG 
Arg Val Asn Ala Ala Asp He Glu Asn Arg Val Leu Glu Leu Asn Lys 

470 475 



1440 



AAG GAG GAG TCC GAG GAT ACA GCC AAG GCT GGC TTC TGG GAG GAG TTT 1488 

Lys Gin Glu Ser Glu Asp Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe 

485 490 495 

GAG AGT TTG CAG AAG CAG GAG GTG AAG AAC TTG CAC CAG CGT CTG GAA 1536 

Glu Ser Leu Gin Lys Gin Glu Val Lys Asn Leu His Gin Arg Leu Glu 

500 505 510 

GGG CAG CGG CPA GAG AAG GGC AAG AAC CGC TAG AAG AAC ATT CTC 1584 

Gly Gin Arg Pro Glu Asn Lys Gly Lys Asn Arg Tyr Lys Asn He Leu 

515 520 525 



CCC TTT GAC CAC AGC CGA GTG ATC CTG CAG GGA CGG GAC AGT AAC ATC 1632 
Pro Phe Asp His Ser Arg Val He Leu Gin Gly Arg Asp Ser Asn He 
20 530 535 

CCC GGG TCC GAC TAG ATC AAT GCC AAC TAG ATC AAG AAC CAG CTG CTA 16 8 0 

Pro Gly Ser Asp Tyr He Asn Ala Asn Tyr lie Lys Asn Gin Leu Leu 

550 555 560 



GGC CCT GAT GAG AAC GCT AAG ACC TAG ATC GCC AGC CAG GGC TGT CTG 172 8 

Gly Pro Asp Glu Asn Ala Lys Thr Tyr He Ala Ser Gin Gly Cys Leu 

570 575 

GAG GCC ACG GTC AAT GAC TTC TGG CAG ATG GCG TGG CAG GAG AAC AGC 177 6 

Glu Ala Thr Val Asn Asp Phe Trp Gin Met Ala Trp Gin Glu Asn Ser 

580 585 590 

CGT GTC ATC GTC ATG ACC ACC CGA GAG GTG GAG AAA GGC CGG AAC AAA 1824 
Arg Val He Val Met Thr Thr Arg Glu Val Glu Lys Gly Arg Asn Lys 
595 600 605 



TGC GTC CCA TAG TGG CCC GAG GTG GGC ATG CAG CGT GCT TAT GGG CCC 1872 

Cys Val Pro Tyr Trp Pro Glu Val Gly Met Gin Arg Ala Tyr Gly Pro 
^0 eiO 615 620 

TAG TCT GTG ACC AAC TGC GGG GAG CAT GAC ACA ACC GAA TAG AAA CTC 192 0 

Tyr Ser Val Thr Asn Cys Gly Glu His Asp Thr Thr Glu Tyr Lys Leu 

^25 630 635 640 



CGT ACC TTA CAG GTC TCC CCG CTG GAC AAT GGA GAC CTG ATT CGG GAG 196 8 

Arg Thr Leu Gin Val Ser Pro Leu Asp Asn Gly Asp Leu He Arg Glu 

645 650 



^y'' ^'^'^'^ C-^- GTC CTC AGC TTC CTG GAC CAG ATC AAC CAG CGG CAG 2 0 64 

55 Clu Pro Gly Gly Val Leu Ser Phe Leu Asp Gin He Asn Gin Arg Glr. 

675 ^P- 
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GAA AGT CTG CCT CAC GCA GGG CCC ATC ATC GTG CAC TGC AGC GCC GGC 2112 
Glu Ser Leu Pro His Ala Gly Pro lie lie Val His Cys Ser Ala Gly 
690 695 700 

5 

ATC GGC CGC ACA GGC ACC ATC ATT GTC ATC GAC ATG CTC ATG GAG AAC 2160 
lie Gly Arg Thr Gly Thr lie He Val He Asp Met Leu Met Glu Aan 
705 710 715 720 

10 ATC TCC ACC AAG GGC CTG GAC TGT GAC ATT GAC ATC CAG AAG ACC ATC 2 2 08 
He Ser Thr Lys Gly Leu Asp Cys Asp He Asp He Gin Lys Thr He 

725 730 735 



35 



55 



y^j^^ tiUU UAG CGC TCG GGC ATG GTG CAG ACG GAG GCG CAG 2256 

15 Gin Met Val Arg Ala Gin Arg Ser Gly Met Val Gin Thr Glu Ala Gin 

740 745 750 

TAC AAG TTC ATC TAG GTG GCC ATC GCC CAG TTC ATT GAA ACC ACT AAG 2 3 04 

Tyr Lys Phe He Tyr Val Ala He Ala Gin Phe He Glu Thr Thr Lys 

20 755 760 765 

AAG AAG CTG GAG GTC CTG CAG TCG CAG AAG GGC CAG GAG TCG GAG TAC 2 3 52 

Lys Lys Leu Glu Val Leu Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr 
770 775 780 

25 

GGG AAC ATC ACC TAT CCC CCA GCC ATG AAG AAT GCC CAT GCC AAG GCC 2 4 00 

Gly Asn He Thr Tyr Pro Pro Ala Met Lys Asn Ala His Ala Lys Ala 
785 790 795 800 

30 TCC CGC ACC TCG TCC AAA CAC AAG GAG GAT GTG TAT GAG AAC CTG CAC 2 44 8 

Ser Arg Thr Ser Ser Lys His Lys Glu Asp Val Tyr Glu Asn Leu His 

805 810 815 



ACT AAG AAC AAG AGG GAG GAG AAA GTG AAG AAG CAG CGG TCA GCA GAC 2 4 96 

Thr Lys Asn Lys Arg Glu Glu Lys Val Lys Lys Gin Arg Ser Ala Asp 

820 825 830 



AAG GAG AAG AGC AAG GGT TCC CTC AAG AGG AAG TGA 25 3 2 

Lys Glu Lys Ser Lys Gly Ser Leu Lys Arg Lys 
40 835 840 



(2) INFORMATION FOR SEQ ID NO: 117 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : sinale 



;xi) SEQUENCE DESCRIPTION: SEQ ZD NO ; 11 7 
Met Val Ser L'/s Glv ^'i; n'-- - r.-- v^-n -^^-^ "-^ 
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vll Olu Leu ASP GXy ASP Val Asn Gly His Lys Phe Ser Val Se. Gly 
aiu Gly Glu G°iy ASP Ala Thr Tyr Gly Lys Le. Thr Leu Lys Phe He 
cys TKr tL Gly Lys Leu Pro "l Pro Trp Pro Thr Leu Val Thr Thr 
.eu tL Tyr Gly Val Gin C^s Phe Ser Ar^ Tyr Pro Asp His Met Lys 
, His ASP Phe Phe Zs ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

. _ IL T.,c A.n ASD Glv Asn Tyr Lys Thr Arg Ala Glu 
Arg Tnr xj-e -^"^ ■> - - - 110 

V.1 .ys P.. Ill Oly ... Th, L,. v.l 11. 31. ■■y^ 

^ Xle «^ ^V. =1" - ^i'o "'^ 

S.. Hi. «n ".1 Tvr II. «=t Al. ».P -V^ 

0 n. V.1 ^„ p" .ys n. hi. ... u. -p 3« 

V.1 =ln .eu «. Hl= TV. =1. Oln T.r «o 11. aly A=P Oly 
,.1 .eu L» pro «P »i= Tyr ser Thr Ola s,r Ma .eu 

25 _ _ ... ,.„. "° MD Hi. «.t val L.U C!l» Ph. 

ser Ly» A«P ^20 

V.1 Thr Ma Ala Oly n. Thr I.e>. Oly Met «p <=1. I- Tyr Ly. Ser 
30 ... Ar. ser Ar, III Met ... s.r Ar, .ly Trp Ph. Hi. Ar. 

,.r =ly ... a'.p Ala 01. Thr Le. L.. I-y- Ar, oly v.l Hi= 
„ly s.r Ph. Al. Arg Pro s.r Ar, .y. As. 01. oly Asp Ph. ser 
ser "al Ar, V.l Oly A.p "I V.l Thr „i. 11. Ar, II. 01. As. 

290 ^ ^ phe Ala Thr Leu 

ser Gly Asp Phe Tyr Asp Leu Tyr Gly Gly Glu Lys 

VI Sr Tvr Thr Gin Gin Gin Gly Val Leu Gin Asp 

40 Thr Glu Leu Val Glu Tyr lyr inr 

Ar, A.P Oly Thr l" 11. Hi. he. .y= Tyr Pro ... A.„ Cya S.r Asp 
pro Thr ser o" Ar, Trp Tyr His oly Hi. H.h s.r Oly Oly 01. Ala 
Olu Thr ... 01. Al. .ys IZ 01. pro Trp Thr Phe ... V.l Ar, 



v a 



1 Met cys Glu Gly Gly Arg Ty: 



fnr Va.i Giy ^-j-^: 
425 



430 



, „ 1 rin H^s Phe Lys Lys Thr Gly He Glu 
Asv Ser Leu Thr Asp Leu Val Glu H.s Pne i.y 
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20 



219 

450 455 460 

Arg Val Asn Ala Ala Asp He Glu Asn Arg Val Leu Glu Leu Asn Lys 
465 470 475 480 

Lys Gin Glu Ser Glu Asp Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe 

485 490 495 

Glu Ser Leu Gin Lys Gin Glu Val Lys Asn Leu His Gin Arg Leu Glu 

500 505 510 

Gly Gin Arg Pro Glu Asn Lys Gly Lys Asn Arg Tyr Lys Asn He Leu 

515 520 525 

Pro Phe Asp His Ser Arg Val He Leu Gin Gly Arg Asp Ser Asn He 

530 535 540 

Pro Gly Ser Asp Tyr He Asn Ala Asn Tyr He Lys Asn Gin Leu Leu 
5^15 550 555 5CC 

Gly Pro Asp Glu Asn Ala Lys Thr Tyr He Ala Ser Gin Gly Cys Leu 
15 565 570 575 

Glu Ala Thr Val Asn Asp Phe Trp Gin Met Ala Trp Gin Glu Asn Ser 

580 585 590 

Arg Val He Val Met Thr Thr Arg Glu Val Glu Lys Gly Arg Asn Lys 

595 600 605 

Cys Val Pro Tyr Trp Pro Glu Val Gly Met Gin Arg Ala Tyr Gly Pro 

610 615 620 

Tyr Ser Val Thr Asn Cys Gly Glu His Asp Thr Thr Glu Tyr Lys Leu 
625 630 635 640 

Arg Thr Leu Gin Val Ser Pro Leu Asp Asn Gly Asp Leu He Arg Glu 
25 645 650 655 

He Trp His Tyr Gin Tyr Leu Ser Trp Pro Asp His Gly Val Pro Ser 

oou ooD 670 

Glu Pro Gly Gly Val Leu Ser Phe Leu Asp Gin He Asn Gin Arg Gin 

675 680 6B5 

Glu Ser Leu Pro His Ala Gly Pro He He Val His Cys Ser Ala Gly 

690 695 700 

He Gly Arg Thr Gly Thr He He Val He Asp Met Leu Met Glu Asn 
705 710 715 720 

He Ser Thr Lys Gly Leu Asp Cys Asp He Asp He Gin Lys Thr He 
35 725 730 735 

Gin Met Val Arg Ala Gin Arg Ser Gly Met Val Gin Thr Glu Ala Gin 

740 745 750 

Tyr Lys Phe He Tyr Val Ala He Ala Gin Phe He Glu Thr Thr Lys 

755 760 765 

Lys Lys Leu Glu Val Leu Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr 

770 775 780 

Gly Asn He Thr Tyr Pro Pro Ala Met Lys Asn Ala His Ala Lys Ala 
785 790 795 800 

Ser Arg Thr Ser Ser Lys His Lys Glu Asp Val Tyr Glu Asn Leu His 
45 805 810 815 

Thr Lys Asn Lys Arg Glu Glu Lys Val Lys Lys Gin Arg Ser Ala Asp 

820 825 830 



30 



40 



(i) SEQUENCE CHA^RACTERISTICS : 
(A) LENGTH: 2562 base pairs 
55 (B) TYPE: nucleic acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

5 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2559 
(D) OTHER INFORMATION: 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

ATG CTG TCC CGT GGG TGG TTT CAC CGA GAC CTC AGT GGG CTG GAT GCA 4 8 

Met Leu Ser Arg Gly Trp Phe His Arg Asp Leu Ser Gly Len Asp Ala 

15 10 15 

15 

GAG ACC CTG CTC AAG GGC CGA GGT GTC CAC GGT AGC TTC CTG GCT CGG 96 

Glu Thr Leu Leu Lys Gly Arg Gly Val His Gly Ser Phe Leu Ala Arg 

20 25 30 

20 CCC AGT CGC AAG AAC CAG GGT GAC TTC TCG CTC TCC GTC AGG GTG GGG 144 
Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser Leu Ser Val Arg Val Gly 
35 40 45 

GAT CAG GTG ACC CAT ATT CGG ATC CAG AAC TCA GGG GAT TTC TAT GAC 192 
25 Asp Gin Val Thr His lie Arg He Gin Asn Ser Gly Asp Phe Tyr Asp 
50 55 60 

CTG TAT GGA GGG GAG AAG TTT GCG ACT CTG ACA GAG CTG GTG GAG TAC 24 0 

Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu Thr Glu Leu Val Glu Tyr 
30 65 70 75 80 

TAC ACT CAG CAG CAG GGT GTC CTG CAG GAC CGC GAC GGC ACC ATC ATC 288 

Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp Arg Asp Gly Thr He He 

85 90 95 

35 

CAC CTC AAG TAC CCG CTG AAC TGC TCC GAT CCC ACT AGT GAG AGG TGG 336 

His Leu Lys Tyr Pro Leu Asn Cys Ser Asp Pro Thr Ser Glu Arg Trp 

100 105 110 

40 TAC CAT GGC CAC ATG TCT GGC GGG CAG GCA GAG ACG CTG CTG CAG GCC 3 84 

Tyr His Gly His Met Ser Gly Gly Gin Ala Glu Thr Leu Leu Gin Ala 
115 120 125 



45 



AAG GGC GAG CCC TGG ACG TTT CTT GTG CGT GAG AGC CTC AGC CAG CCT 4 32 

Lys Gly Glu Pro Trp Thr Phe Leu Val Arg Glu Ser Leu Ser Gin Pro 

130 135 140 

GGA GAC TTC GTG CTT TCT GTG CTC AG'" ^^A^ ^A^- ^'^^ t.t,^. -^^^ 



.Gu Arg 
165 



"::r His lie Lys 

170 



Gl 



^ -J- y Lj J. 
1 '/ b 
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Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe Asp Ser Leu Thr Asp Leu 

180 185 190 

GTA GAG CAT TTC AAG AAG ACG GGG ATT GAG GAG GCC TCA GGC GCC TTT 624 
5 Val Glu His Phe Lys Lys Thr Gly lie Glu Glu Ala Ser Gly Ala Phe 

195 200 205 

GTC TAG CTG CGG CAG CCG TAG TAT GCC ACG AGG GTG AAT GCG GCT GAC 672 
Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr Arg Val Asn Ala Ala Asp 
10 210 215 220 

ATT GAG AAC CGA GTG TTG GAA CTG AAC AAG AAG CAG GAG TCC GAG GAT 72 0 

lie Glu Asn Arq Val Leu Glu Leu Asn Lvs Lvs Gin CAu Srt- niii Arti 
225 230 235 240 

15 

ACA GCC AAG GCT GGC TTC TGG GAG GAG TTT GAG ACT TTG CAG AAG CAG 768 

Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe Glu Ser Leu Gin Lys Gin 

245 250 255 

20 GAG GTG AAG AAC TTG CAC CAG CGT CTG GAA GGG CAG CGG CCA GAG AAC 816 
Glu Val Lys Asn Leu His Gin Arg Leu Glu Gly Gin Arg Pro Glu Asn 

260 265 270 

AAG GGC AAG AAC CGC TAG AAG AAC ATT CTC CCC TTT GAC CAC AGC CGA 8 64 

25 Lys Gly Lys Asn Arg Tyr Lys Asn lie Leu Pro Phe Asp His Ser Arg 

275 280 2B5 

GTG ATC CTG CAG GGA CGG GAC AGT AAC ATC CCC GGG TCC GAC TAC ATC 912 
Val He Leu Gin Gly Arg Asp Ser Asn He Pro Gly Ser Asp Tyr lie 
30 290 295 300 

AAT GCC AAC TAC ATC AAG AAC CAG CTG CTA GGC CCT GAT GAG AAC GCT 960 

Asn Ala Asn Tyr He Lys Asn Gin Leu Leu Gly Pro Asp Glu Asn Ala 
305 310 315 320 

35 

AAG ACC TAC ATC GCC AGC CAG GGC TGT CTG GAG GCC ACG GTC AAT GAC 10 0 8 

Lys Thr Tyr He Ala Ser Gin Gly Cys Leu Glu Ala Thr Val Asn Asp 

325 330 335 

40 TTC TGG CAG ATG GCG TGG CAG GAG AAC AGC CGT GTC ATC GTC ATG ACC 10 5 6 

Phe Trp Gin Met Ala Trp Gin Glu Asn Ser Arg Val He Val Met Thr 

340 345 350 

ACC CGA GAG GTG GAG AAA GGC CGG AAC AAA TGC GTC CCA TAC TGG CCC 1104 
45 Thr Arg Glu Val Glu Lys Gly Arg Asn Lys Cys Val Pro Tyr Trp Pro 

355 360 365 

^T^f^ l^^^r^ J\'^^ '^^'^ O'^T' r^T^ fn^F-T-l ^ ^ ^ r-f^ ^r^^ py,^^ 



55 



Gly Glu His Aso Thr Thr Glu Tvr Gvs Leu Arc: Thr Leu Gin Va;. Ser 
385 390 395 400 
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Pro Leu Asp Asn Gly Asp Leu He Arg Glu He Trp His Tyr Gin Tyr 

405 410 415 

CTG AGC TGG CCC GAG CAT GGG GTC CCC AGT GAG OCT GGG GGT GTC CTC 12 96 

5 Leu Ser Trp Pro Asp His Gly Val Pro Ser Glu Pro Gly Gly Val Leu 

420 425 430 

AGC TTC CTG GAC CAG ATC AAC CAG CGG CAG GAA AGT CTG CCT CAC GCA 13 44 

Ser Phe Leu Asp Gin He Asn Gin Arg Gin Glu Ser Leu Pro His Ala 

10 435 440 445 

GGG CCC ATC ATC GTG CAC TGC AGC GCC GGC ATC GGC CGC ACA GGC ACC 13 92 

Glv Prn T]e Tip Val His Cvs Spr Al^ Glv Tlf^ nl v A-rn Thr- nl v Th-r 

J J — J — J — J — J 

450 455 460 

15 

ATC ATT GTC ATC GAC ATG CTC ATG GAG AAC ATC TCC ACC AAG GGC CTG 144 0 

He He Val He Asp Met Leu Met Glu Asn He Ser Thr Lys Gly Leu 

465 470 475 480 

20 GAC TGT GAC ATT GAC ATC CAG AAG ACC ATC CAG ATG GTG CGG GCG CAG 14 8 8 

Asp Cys Asp He Asp He Gin Lys Thr He Gin Met Val Arg Ala Gin 

485 490 495 

CGC TCG GGC ATG GTG CAG ACG GAG GCG CAG TAC AAG TTC ATC TAC GTG 153 6 

25 Arg Ser Gly Met Val Gin Thr Glu Ala Gin Tyr Lys Phe He Tyr Val 

500 505 510 

GCC ATC GCC CAG TTC ATT GAA ACC ACT AAG AAG AAG CTG GAG GTC CTG 15 8 4 

Ala He Ala Gin Phe He Glu Thr Thr Lys Lys Lys Leu Glu Val Leu 

30 515 520 525 

CAG TCG CAG AAG GGC CAG GAG TCG GAG TAC GGG AAC ATC ACC TAT CCC 163 2 

Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr Gly Asn He Thr Tyr Pro 

530 535 540 

35 

CCA GCC ATG AAG AAT GCC CAT GCC AAG GCC TCC CGC ACC TCG TCC AAA 16 8 0 

Pro Ala Met Lys Asn Ala His Ala Lys Ala Ser Arg Thr Ser Ser Lys 

545 550 555 560 

40 CAC AAG GAG GAT GTG TAT GAG AAC CTG CAC ACT AAG AAC AAG AGG GAG 17 2 8 

His Lys Glu Asp Val Tyr Glu Asn Leu His Thr Lya Asn Lys Arg Glu 

565 570 575 

GAG AAA GTG AAG AAG CAG CGG TCA GCA GAC AAG GAG AAG AGC AAG GGT 17 7 6 

45 Glu Lys Val Lys Lys Gin Arg Ser Ala Asp Lys Glu Lys Ser Lys Gly 

580 585 590 



55 



Agp Pro Pro Val Ala Tnr Met Val Ser Lvg Gl 
610 615 



Gxu Glu Leu Phe Th: 
620 
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Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
625 630 635 640 

AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG 1968 
5 Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

645 650 655 

CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG 2 016 
Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
10 660 665 670 

CCC ACC CTC GTG ACC ACC CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC 2064 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arq 

675 680 685 

15 

TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC 2112 

Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

690 695 700 

20 GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC 216 0 
Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
705 710 715 720 

TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC 2 2 0.8 
25 Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

725 730 735 

CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG 2 256 
Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 
30 740 745 750 

GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG 2 3 04 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 

755 760 765 

35 

GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC 2 3 52 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His 

770 775 780 

40 AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC 2 400 
Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
785 790 795 800 

ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG 244 8 
45 Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

805 810 815 

AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT r^A^ -^^.Of. 



i...HJ - . l^-l 



r .X s A* ... a, G \' ^r.r i^eii 



83 5 



34 0 



645 



55 
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Asp Glu Leu Tyr Lys 
850 



5 (2) INFORMATION FOR SEQ ID NO: 119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 53 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 



Met Leu Ser Arg Gly Trp Phe His Arg Asp Leu Ser Gly Leu Asp Ala 
15 10 15 

20 Glu Thr Leu Leu Lys Gly Arg Gly Val His Gly Ser Phe Leu Ala Arg 

20 25 30 

Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser Leu Ser Val Arg Val Gly 

35 40 45 

Asp Gin Val Thr His lie Arg lie Gin Asn Ser Gly Asp Phe Tyr Asp 
25 50 55 60 

Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu Thr Glu Leu Val Glu Tyr 
65 70 75 80 

Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp Arg Asp Gly Thr He He 

85 90 95 

30 His Leu Lys Tyr Pro Leu Asn Cys Ser Asp Pro Thr Ser Glu Arg Trp 

100 105 110 

Tyr His Gly His Met Ser Gly Gly Gin Ala Glu Thr Leu Leu Gin Ala 

115 120 125 

Lys Gly Glu Pro Trp Thr Phe Leu Val Arg Glu Ser Leu Ser Gin Pro 
35 130 135 140 

Gly Asp Phe Val Leu Ser Val Leu Ser Asp Gin Pro Lys Ala Gly Pro 
145 150 155 160 

Gly Ser Pro Leu Arg Val Thr His He Lye Val Met Cys Glu Gly Gly 

165 170 175 

40 Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe Asp Ser Leu Thr Asp Leu 

180 185 190 

Val Glu His Phe Lys Lys Thr Gly He Glu Glu Ala Ser Gly Ala Phe 

195 200 205 

Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr Arg Val Asn Ala Ala Asp 
45 210 215 220 

He Glu Asn Arg Val Leu Glu Leu Asn Lys Lys Gin Glu Ser Glu Asp 
225 230 235 240 

'yV^y A"*:^ n'^T,' PV*^ '^t---^ r-i ^ . , pl^(3 ^'--^ Cr-i-v- ^n - ^ • ' r~ ^ 



55 



275 

Val He Leu Gin Gly 
290 



280 

Arg Asp Ser Asn 
295 



285 

le Pro Gly Ser Asp 
3 00 



vr 



T 1 
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225 

305 310 315 320 

Lys Thr Tyr He Ala Ser Gin Gly Cys Leu Glu Ala Thr Val Asn Asp 

325 330 335 

Phe Trp Gin Met Ala Trp Gin Glu Asn Ser Arg Val He Val Met Thr 
5 340 345 350 

Thr Arg Glu Val Glu Lys Gly Arg Asn Lys Cys Val Pro Tyr Trp Pro 

355 360 365 

Glu Val Gly Met Gin Arg Ala Tyr Gly Pro Tyr Ser Val Thr Asn Cys 
370 375 380 

10 Gly Glu His Asp Thr Thr Glu Tyr Lys Leu Arg Thr Leu Gin Val Ser 
385 390 395 400 

Pro Leu Asp Asn Gly Asp Leu He Arg Glu He Trp His Tyr Gin Tyr 

405 410 415 

Leu Ser Trp Fro Asp His Gly Val Pro Ser Glu Pro Gly Gly Val Leu 
"•5 420 425 430 

Ser Phe Leu Asp Gin He Asn Gin Arg Gin Glu Ser Leu Pro His Ala 

435 440 445 

Gly Pro He He Val His Cys Ser Ala Gly He Gly Arg Thr Gly Thr 

450 455 460 

He He Val He Asp Met Leu Met Glu Asn He Ser Thr Lys Gly Leu 
465 470 475 480 

Asp Cys Asp He Asp He Gin Lys Thr He Gin Met Val Arg Ala Gin 

485 490 495 

Arg Ser Gly Met Val Gin Thr Glu Ala Gin Tyr Lys Phe He Tyr Val 
25 500 505 510 

Ala He Ala Gin Phe He Glu Thr Thr Lys Lys Lys Leu Glu Val Leu 

515 520 525 

Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr Gly Asn He Thr Tyr Pro 

530 535 540 

Pro Ala Met Lys Asn Ala His Ala Lys Ala Ser Arg Thr Ser Ser Lys 
545 550 555 560 

His Lys Glu Asp Val Tyr Glu Asn Leu His Thr Lys Asn Lys Arg Glu 

565 570 575 

Glu Lys Val Lys Lys Gin Arg Ser Ala Asp Lys Glu Lys Ser Lys Gly 
35 580 585 590 

Ser Leu Lys Arg Lys Arg He Leu Gin Ser Thr Val Pro Arg Ala Arg 

595 600 605 

Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 
610 615 620 

40 Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
625 630 635 640 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

645 650 655 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
45 660 665 670 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 

675 680 685 

Tyr Pro Asp His Met Lys Gin Hi? Asn Ph*^ Ph^- L 



30 



55 



Arg He Giu Leu Lye 

740 



c Asp Phe 
745 



lys Glu Asp Gly Asn I^e Le 

750 



^1 Gl- 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



10 



226 

755 760 765 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His 

770 775 780 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
785 790 795 800 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

805 810 815 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

820 825 830 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met 

835 840 845 

Asp Glu Leu Tyr Lys 
850 

15 (2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 994 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 
(ix) FEATURE: 

25 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION- 1...2 991 
(D) OTHER INFORMATION: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
is 10 15 



35 



40 



45 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 9 6 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 14 4 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



55 



3in His Asp Phe Phe Ijvs Scr Ala Met: Pro Glu G.]y Tyr Val Gin Glu 

85 90 95 
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Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

5 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
10 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

15 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

20 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
25 Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
30 210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

225 230 235 240 

35 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG ACC ATG GAG CGG CCC 76 8 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Glu Arg Pro 

245 250 255 

40 CCG GGG CTG CGG CCG GGC GCG GGC GGG CCC TGG GAG ATG CGG GAG CGG 816 
Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu Met Arg Glu Arg 

260 265 270 

CTG GGC ACC GGC GGC TTC GGG AAC GTC TGT CTG TAC CAG CAT CGG GAA 864 
45 Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr Gin His Arg Glu 

275 280 285 

^TT HAT CT'^ T^AA ATA C^^A ^^T'^ AA'~1 T^-y -^^rr rr./^ ^'^A GA^ ^'^A Af^T A'^P p 



Liys Asn Arg 
305 



31C 



^ ^ L lie ^^yo -i--/"- 



315 



320 
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Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro Glu Glu Leu Asn 

325 330 335 

ATT TTG ATT CAT GAT GTG CCT CTT CTA GCA ATG GAA TAG TGT TCT GGA 1056 
5 lie Leu He His Asp Val Pro Leu Leu Ala Met Glu Tyr Cys Ser Gly 

340 345 350 

GGA GAT CTC CGA AAG CTG CTC AAC AAA CCA GAA AAT TGT TGT GGA CTT 1104 
Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn Cys Cys Gly Leu 
10 355 360 365 

AAA GAA AGC CAG ATA CTT TCT TTA CTA AGT GAT ATA GGG TCT GGG ATT 1152 
Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He Gly Ser Gly He 
370 375 380 

15 

CGA TAT TTG CAT GAA AAC AAA ATT ATA CAT CGA GAT CTA AAA CCT GAA 120 0 
Arg Tyr Leu His Glu Asn Lys He He His Arg Asp Leu Lys Pro Glu 
385 390 395 400 

20 AAC ATA GTT CTT CAG GAT GTT GGT GGA AAG ATA ATA CAT AAA ATA ATT 124 8 
Asn He Val Leu Gin Asp Val Gly Gly Lys He He His Lys He He 

405 410 415 

GAT CTG GGA TAT GCC AAA GAT GTT GAT CAA GGA AGT CTG TGT ACA TCT 12 96 
25 Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser Leu Cys Thr Ser 

420 425 430 

TTT GTG GGA ACA CTG CAG TAT CTG GCC CCA GAG CTC TTT GAG AAT AAG 1344 
Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu Phe Glu Asn Lys 
30 435 440 445 

CCT TAC ACA GCC ACT GTT GAT TAT TGG AGC TTT GGG ACC ATG GTA TTT 13 92 

Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly Thr Met Val Phe 

450 455 460 

35 

GAA TGT ATT GCT GGA TAT AGG CCT TTT TTG CAT CAT CTG CAG CCA TTT 144 0 

Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His Leu Gin Pro Phe 

465 470 475 480 

40 ACC TGG CAT GAG AAG ATT AAG AAG AAG GAT CCA AAG TGT ATA TTT GCA 14 8 8 

Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys Cys He Phe Ala 

485 490 495 

TGT GAA GAG ATG TCA GGA GAA GTT CGG TTT AGT AGC CAT TTA CCT CAA 153 6 

45 Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser His Leu Pro Gin 

500 505 510 



55 



CCA AAT AGC CTT TGT AGT TTA ATA GTA GAA C^C ATG GAA AA^ 



n :.eu MeL :.eu Asn Trp Asp Pro L;.!.n Gin Arg Gly Giv^ Pro Vai Acp 



53C 535 540 
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Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met Asp His lie Leu 
545 550 555 560 

AAT TTG AAG ATA GTA CAC ATC CTA AAT ATG ACT TCT GCA AAG ATA ATT 172 8 
5 Asn Leu Lys lie Val His He Leu Asn Met Thr Ser Ala Lys He He 

565 570 575 

TCT TTT CTG TTA CCA CCT GAT GAA AGT CTT CAT TCA CTA CAG TCT CGT 17 7 6 

Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser Leu Gin Ser Arg 
10 580 585 590 

ATT GAG CGT GAA ACT GGA ATA AAT ACT GGT TCT CAA GAA CTT CTT TCA 1824 
He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin Glu Leu Leu Ser 
595 600 605 

15 

GAG ACA GGA ATT TCT CTG GAT CCT CGG AAA CCA GCC TCT CAA TGT GTT 1872 
Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala Ser Gin Cys Val 
610 615 620 

20 CTA GAT GGA GTT AGA GGC TGT GAT AGC TAT ATG GTT TAT TTG TTT GAT 192 0 

Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val Tyr Leu Phe Asp 
625 630 635 640 

AAA AGT AAA ACT GTA TAT GAA GGG CCA TTT GCT TCC AGA AGT TTA TCT 19 6 8 

25 Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser Arg Ser Leu Ser 

645 650 655 

GAT TGT GTA AAT TAT ATT GTA CAG GAC AGC AAA ATA CAG CTT CCA ATT 2 016 
Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He Gin Leu Pro He 
30 660 665 670 

ATA CAG CTG CGT AAA GTG TGG GCT GAA GCA GTG CAC TAT GTG TCT GGA 2 0 64 

He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His Tyr Val Ser Gly 
675 680 685 

35 

CTA AAA GAA GAC TAT AGC AGG CTC TTT CAG GGA CAA AGG GCA GCA ATG 2112 

Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin Arg Ala Ala Met 
690 695 700 

40 TTA AGT CTT CTT AGA TAT AAT GCT AAC TTA ACA AAA ATG AAG AAC ACT 2160 
Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys Met Lys Asn Thr 
705 710 715 720 

TTG ATC TCA GCA TCA CAA CAA CTG AAA GCT AAA TTG GAG TTT TTT CAC 2 208 

45 Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu Glu Phe Phe His 

725 730 735 

AAA AGC ATT CAG CTT GAC TTG GAG AGA TA^ AGG GA.G CA^- A'TG A^^ '"AT 
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G.i>' l^e Ser Ser Glu Lys Met Leu Lys Ala Trp l^yG Giu Met Glu Glu 
755 760 765 
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hys Ala lie His Tyr Ala Glu Val Gly Val lie Gly Tyr Leu Glu Asp 
770 775 780 

CAG ATT ATG TCT TTG CAT GOT GAA ATC ATG GGG CTA CAG AAG AGC CCC 2400 
5 Gin lie Met Ser Leu His Ala Glu He Met Gly Leu Gin Lys Ser Pro 
785 790 795 800 

TAT GGA AGA CGT CAG GGA GAC TTG ATG GAA TCT CTG GAA CAG CGT GCC 2 44 8 

Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu Glu Gin Arg Ala 
10 805 810 815 

ATT GAT CTA TAT AAG CAG TTA AAA CAC AGA CCT TCA GAT CAC TCC TAC 2 4 96 
He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser Asp His Ser Tyr 

820 825 830 

15 

AGT GAC AGC ACA GAG ATG GTG AAA ATC ATT GTG CAC ACT GTG CAG AGT 2 544 
Ser Asp Ser Thr Glu Met Val Lys He He Val His Thr Val Gin Ser 
835 840 845 

20 CAG GAC CGT GTG CTC AAG GAG CTG TTT GGT CAT TTG AGC AAG TTG TTG 2 5 92 
Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu Ser Lys Leu Leu 
850 855 860 

GGC TGT AAG CAG AAG ATT ATT GAT CTA CTC CCT AAG GTG GAA GTG GCC 2 64 0 
25 Gly Cys Lys Gin Lys He He Asp Leu Leu Pro Lys Val Glu Val Ala 
865 870 875 880 

CTC AGT AAT ATC AAA GAA GCT GAC AAT ACT GTC ATG TTC ATG CAG GGA 2 6 88 
Leu Ser Asn He Lys Glu Ala Asp Asn Thr Val Met Phe Met Gin Gly 
30 885 890 895 

AAA AGG CAG AAA GAA ATA TGG CAT CTC CTT AAA ATT GCC TGT ACA CAG 2 73 6 

Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He Ala Cys Thr Gin 

900 905 910 

35 

AGT TCT GCC CGC TCT CTT GTA GGA TCC AGT CTA GAA GGT GCA GTA ACC 2 7 84 

Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu Gly Ala Val Thr 

915 920 925 

40 CCT CAG ACA TCA GCA TGG CTG CCC CCG ACT TCA GCA GAA CAT GAT CAT 2 83 2 
Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala Glu His Asp His 
930 935 940 

TCT CTG TCA TGT GTG GTA ACT CCT CAA GAT GGG GAG ACT TCA GCA CAA 2 880 
45 Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu Thr Ser Ala Gin 
945 950 955 960 

ATG ATA GAA GAA AAT TTG AAC TGT CTT GGC CAT T^^A AGC ACT AT'^ AT'" -^Q^P 



9B0 



li": Z\y As:: Se^r Met Xet Asn Leu Asp Trp 



985 



990 



55 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



231 



Ser Trp Leu Thr Glu 
995 



5 (2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

20 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
25 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 7 5 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

30 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
35 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

40 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
45 210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Glv Leu Arn Spr Ara Al '^1'^ n ^r-:^- ^ co >- -^v- v,^, * ■ * ■ 



55 



2 7 ^ 2 8 C 2 B 5 

Le'^ Acp Leu Lys He Ala He Lys Ser Cys Arg Leu Glu Leu Ser Thr 
290 295 300 
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305 310 315 320 

Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro Glu Glu Leu Asn 

325 330 335 

lie Leu lie His Asp Val Pro Leu Leu Ala Met Glu Tyr Cys Ser Gly 

340 345 350 

Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn Cys Cys Gly Leu 

355 360 365 

Lys Glu Ser Gin lie Leu Ser Leu Leu Ser Asp He Gly Ser Gly He 

370 375 380 

Arg Tyr Leu His Glu Asn Lys He He His Arg Asp Leu Lys Pro Glu 
3B5 390 395 400 

Asn He Val Leu Gin Asp Val Gly Gly Lys He He His Lys He He 

405 410 415 

Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser Leu Cys Thr Ser 
15 420 425 430 

Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu Phe Glu Asn Lys 

435 440 445 

Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly Thr Met Val Phe 
450 455 460 

20 Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His Leu Gin Pro Phe 
465 470 475 48O 

Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys Cys He Phe Ala 

485 490 495 

Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser His Leu Pro Gin 
25 500 505 510 

Pro Asn Ser Leu Cys Ser Leu He Val Glu Pro Met Glu Asn Trp Leu 

515 520 525 

Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly Gly Pro Val Asp 
530 535 540 

30 Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met Asp His He Leu 
545 550 555 560 

Asn Leu Lys He Val His He Leu Asn Met Thr Ser Ala Lys He He 

565 570 575 

Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser Leu Gin Ser Arg 
35 580 585 590 

He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin Glu Leu Leu Ser 

595 600 605 

Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala Ser Gin Cys Val 

610 615 620 

Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val Tyr Leu Phe Asp 
625 630 635 640 

Lys Ser Lye Thr Val Tyr Glu Gly Pro Phe Ala Ser Arg Ser Leu Ser 

645 650 655 

Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He Gin Leu Pro He 
"^5 660 665 670 

He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His Tyr Val Ser Gly 

675 680 685 

Leu Lys Glu Asp Tyr Ser Ara Leu Phe Gin ^Iv nin A^^ ^ ' - ^ ^ m^^ 
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''2 b 7 3 0 7 3 5 

Lys Gcr He Gin Leu Asp Leu Glu Arg Tyr Ser Glu Glr: Met: Thr Tyi 
55 740 745 75C 

J 1 \' T"^ co-f- 5^«^r ^^i; 
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755 760 765 

Lys Ala He His Tyr Ala Glu Val Gly Val He Gly Tyr Leu Glu Asp 

770 775 780 

Gin He Met Ser Leu His Ala Glu He Met Gly Leu Gin Lys Ser Pro 
5 785 790 795 800 

Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu Glu Gin Arg Ala 

805 810 815 

He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser Asp His Ser Tyr 

820 825 830 

10 Ser Asp Ser Thr Glu Met Val Lys lie lie Val His Thr Val Gin Ser 

835 840 845 

Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu Ser Lys Leu Leu 

850 855 860 

Gly L*ys Lys Gin Lys He He Asp Leu Leu Pro Lys Val Glu Val Ala 
15 865 870 875 880 

Leu Ser Asn lie Lys Glu Ala Asp Asn Thr Val Met Phe Met Gin Gly 

885 890 895 

Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He Ala Cys Thr Gin 

900 905 910 

20 Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu Gly Ala Val Thr 

915 920 925 

Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala Glu His Asp His 

930 935 940 

Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu Thr Ser Ala Gin 
25 945 950 955 960 

Met He Glu Glu Asn Leu Asn Cys Leu Gly His Leu Ser Thr He He 

965 970 975 

His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met Asn Leu Asp Trp 

980 985 990 

30 Ser Trp Leu Thr Glu 

995 

(2) INFORMATION FOR SEQ ID NO: 12 2: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
45 (B) LOCATION: 1...2988 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRTPTTON: SEQ : 2 2 



ATG CGG GAG CGG CTG GGC ACC GGC GGC TTC GGG AAC GTC TGT CTG TAG 
55 Met Arg Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr 

2C ^ 
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CAG CAT CGG GAA CTT GAT CTC AAA ATA GCA ATT AAG TCT TGT CGC CTA 14 4 

Gin His Arg Glu Leu Asp Leu Lys lie Ala lie Lys Ser Cys Arg Leu 
35 40 45 

5 

GAG CTA AGT ACC AAA AAC AGA GAA CGA TGG TGC CAT GAA ATC CAG ATT 192 

Glu Leu Ser Thr Lys Asn Arg Glu Arg Trp Cys His Glu He Gin He 
50 55 60 

10 ATG AAG AAG TTG AAC CAT GCC AAT GTT GTA AAG GCC TGT GAT GTT CCT 24 0 

Met Lys Lys Leu Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro 
65 70 75 80 

GAA GAA TTG AAT ATT TTG ATT CAT GAT GTG CCT CTT CTA GCA ATG GAA 28 8 

15 Glu Glu Leu Asn He Leu He His Asp Val Pro Leu Leu Ala Met Glu 

85 90 95 

TAG TGT TCT GGA GGA GAT CTC CGA AAG CTG CTC AAC AAA CCA GAA AAT 33 6 

Tyr Cys Ser Gly Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn 
20 100 105 110 

TGT TGT GGA CTT AAA GAA AGC CAG ATA CTT TCT TTA CTA AGT GAT ATA 3 84 

Cys Cys Gly Leu Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He 
115 120 125 

25 

GGG TCT GGG ATT CGA TAT TTG CAT GAA AAC AAA ATT ATA CAT CGA GAT 43 2 

Gly Ser Gly Tie Arg Tyr Leu His Glu Asn Lys He He His Arg Asp 
130 135 140 

30 CTA AAA CCT GAA AAC ATA GTT CTT CAG GAT GTT GGT GGA AAG ATA ATA 480 
Leu Lys Pro Glu Asn He Val Leu Gin Asp Val Gly Gly Lys He He 
145 150 155 160 

CAT AAA ATA ATT GAT CTG GGA TAT GCC AAA GAT GTT GAT CAA GGA AGT 52 8 

35 His Lys He He Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser 

165 170 175 

CTG TGT ACA TCT TTT GTG GGA ACA CTG CAG TAT CTG GCC CCA GAG CTC 576 
Leu Cys Thr Ser Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu 
40 180 185 190 

TTT GAG AAT AAG CCT TAC ACA GCC ACT GTT GAT TAT TGG AGC TTT GGG 624 

Phe Glu Asn Lys Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly 
195 200 205 

45 

ACC ATG GTA TTT GAA TGT ATT GCT GGA TAT AGG CCT TTT TTG CAT CAT 6 72 

Thr Met Val Phe Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His 

Tin "'"■r ^ ^ 



TGT ATA l^T GCA TGT GAA GAG ATG TCA GGA GAA GTT CGG TTT AGT AGC '66 
55 Cys He Phe Ala Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser 
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CAT TTA CCT CAA CCA AAT AGC CTT TGT AGT TTA ATA GTA GAA CCC ATG 816 

His Leu Pro Gin Pro Asn Ser Leu Cys Ser Leu lie Val Glu Pro Met 

260 265 270 

5 

GAA AAC TGG CTA CAG TTG ATG TTG AAT TGG GAG CCT CAG CAG AGA GGA 8 64 

Glu Asn Trp Leu Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly 

275 280 285 

10 GGA CCT GTT GAC CTT ACT TTG AAG CAG CCA AGA TGT TTT GTA TTA ATG 912 
Gly Pro Val Asp Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met 
290 295 300 

GAT CaC ATT TTG AAT TTG AAG ATA GTA CAC ATC CTA AAT ATG ACT TCT 96 0 

15 Asp His He Leu Asn Leu Lys He Val His He Leu Asn Met Thr Ser 
305 310 315 320 

GCA AAG ATA ATT TCT TTT CTG TTA CCA CCT GAT GAA AGT CTT CAT TCA 10 08 
Ala Lys He He Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser 
20 325 330 335 

CTA CAG TCT CGT ATT GAG CGT GAA ACT GGA ATA AAT ACT GGT TCT CAA 1056 

Leu Gin Ser Arg He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin 

340 345 350 

25 

GAA CTT CTT TCA GAG ACA GGA ATT TCT CTG GAT CCT CGG AAA CCA GCC 1104 

Glu Leu Leu Ser Glu Thr Gly He Ser Leu Asp Pro Arq Lys Pro Ala 

355 360 365 

30 TCT CAA TGT GTT CTA GAT GGA GTT AGA GGC TGT GAT AGC TAT ATG GTT 1152 
Ser Gin Cya Val Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val 
370 375 380 



35 



45 



TAT TTG TTT GAT AAA AGT AAA ACT GTA TAT GAA GGG CCA TTT GCT TCC 12 0 0 

Tyr Leu Phe Asp Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser 
385 390 395 400 



AGA AGT TTA TCT GAT TGT GTA AAT TAT ATT GTA CAG GAC AGC AAA ATA 12 4 8 

Arg Ser Leu Ser Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He 
40 405 410 415 

CAG CTT CCA ATT ATA CAG CTG CGT AAA GTG TGG GCT GAA GCA GTG CAC 12 96 

Gin Leu Pro He He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His 

420 425 430 



TAT GTG TCT GGA CTA AAA GAA GAC TAT AGC AGG CTC TTT CAG GGA CAA 13 44 

Tyr Val Ser Gly Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin 
435 440 445 



ATG AAG AAC ACT TTG ATC TCA GCA TCA CAA CAA CTG A/vA GCT AAA TTG :4 4C 

55 Met Lys Asn Thr Leu lie Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu 
4 6 5 4 •: ^ - : - ' 
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GAG TTT TTT CAC AAA AGC ATT CAG CTT GAC TTG GAG AGA TAG AGC GAG 14 8 8 

Glu Phe Phe His Lys Ser He Gin Leu Asp Leu Glu Arg Tyr Ser Glu 

485 490 495 

5 

CAG ATG ACG TAT GGG ATA TCT TCA GAA AAA ATG CTA AAA GCA TGG AAA 15 3 6 

Gin Met Thr Tyr Gly He Ser Ser Glu Lys Met Leu Lys Ala Trp Lys 

500 505 510 

10 GAA ATG GAA GAA AAG GCC ATC CAC TAT GCT GAG GTT GGT GTC ATT GGA 15 84 

Glu Met Glu Glu Lys Ala He His Tyr Ala Glu Val Gly Val lie Gly 
515 520 525 

TAC CTG GAG GAT CAG ATT ATG TCT TTG CAT GCT GAA ATC ATG GGG CTA 163 2 

15 Tyr Leu Glu Asp Gin He Met Ser Leu His Ala Glu He Met Gly Leu 
530 535 540 

CAG AAG AGC CCC TAT GGA AGA CGT CAG GGA GAC TTG ATG GAA TCT CTG 16 80 

Gin Lys Ser Pro Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu 

20 545 550 555 560 

GAA CAG CGT GCC ATT GAT CTA TAT AAG CAG TTA AAA CAC AGA CCT TCA 17 2 8 

Glu Gin Arg Ala He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser 

565 570 575 

25 

GAT CAC TCC TAC AGT GAC AGC ACA GAG ATG GTG AAA ATC ATT GTG CAC 177 6 

Asp His Ser Tyr Ser A.sp Ser Thr Glu Met Val Lys He He Val His 

580 585 590 

30 ACT GTG CAG AGT CAG GAC CGT GTG CTC AAG GAG CTG TTT GGT CAT TTG 1824 

Thr Val Gin Ser Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu 
595 600 605 

AGC AAG TTG TTG GGC TGT AAG CAG AAG ATT ATT GAT CTA CTC CCT AAG 1872 

35 Ser Lys Leu Leu Gly Cys Lys Gin Lys He He Asp Leu Leu Pro Lys 
610 615 620 

GTG GAA GTG GCC CTC AGT AAT ATC AAA GAA GCT GAC AAT ACT GTC ATG 192 0 

Val Glu Val Ala Leu Ser Asn He Lys Glu Ala Asp Asn Thr Val Met 

40 625 630 635 640 

TTC ATG CAG GGA AAA AGG CAG AAA GAA ATA TGG CAT CTC CTT AAA ATT 196 8 

Phe Met Gin Gly Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He 

645 650 655 
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GCC TGT ACA CAG AGT TCT GCC CGC TCT CTT GTA GGA TCC AGT CTA GAA 2 016 
Ala Cys Thr Gin Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu 



GAA CAT GAT CAT TCT CTG TCA TGT GTG GTA ACT CCT CAA GAT GGG GAG 2 112 

55 Glu His Asp His Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu 
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ACT TCA GCA CAA ATG ATA GAA GAA AAT TTG AAC TGC CTT GGC CAT TTA 216 0 

Thr Ser Ala Gin Met lie Glu Glu Asn Leu Asn Cys Leu Gly His Leu 
705 710 715 720 

5 

AGC ACT ATT ATT CAT GAG GCA AAT GAG GAA CAG GGC AAT AGT ATG ATG 220 8 

Ser Thr lie He His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met 

725 730 735 

10 AAT CTT GAT TGG AGT TGG TTA ACA GAA TGG GTA CCG CGG GCC CGG GAT 225 6 

Asn Leu Asp Trp Ser Trp Leu Thr Glu Trp Val Pro Arg Ala Arg Asp 

740 745 750 

CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 23 04 

15 Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 

755 760 765 

GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 2 3 52 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
20 770 775 780 

TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG 2 40 0 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 

785 790 795 800 

25 

ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 244 8 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 

805 810 815 

30 ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC 2 4 96 
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

820 825 830 

CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 2 54 4 
35 Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

835 840 845 

GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 2 5 92 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
40 850 855 860 

AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC 2 64 0 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
865 870 875 880 



45 



ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 2 68 8 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

885 890 R9S 



GAC AAG CAG AAG /vAC GGC ATC AAG GTG ^vAC TTC AAG ATC CGC CAC AAC 2 7 84 

55 Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 

915 9 2 0 '^7 
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ATC GAG 
He Glu 
930 

5 

CCC ATC 
Pro He 
945 

10 ACC CAG 
Thr Gin 



15 Val Leu 



GAG CTG 
Glu Leu 

20 



GAC GGC AGC GTG 
Asp Gly Ser Val 

GGC GAC GGC CCC 
Gly Asp Gly Pro 

950 

TCC GCC CTG AGC 
Ser Ala Leu Ser 
965 

CTG GAU TTC GTG 
Leu Glu Phe Val 
980 

TAC AAG TAA 
Tyr Lys 
995 



CAG CTC GCC GAC 
Gin Leu Ala Asp 
935 

GTG CTG CTG CCC 
Val Leu Leu Pro 



AAA GAC CCC AAC 
Lys Asp Pro Asn 

970 

ACC GCC GCC GGG 
Thr Ala Ala Gly 
985 



CAC TAC CAG CAG 
His Tyr Gin Gin 
940 

GAC AAC CAC TAC 
Asp Asn His Tyr 
955 

GAG AAG CGC GAT 
Glu Lys Arg Asp 

ATC ACT CTC GGC 
He Thr Leu Gly 

990 



AAC ACC 2 832 
Asn Thr 



CTG AGC 2 880 
Leu Ser 
960 

CAC ATG 2 92 8 

His Met 

975 

ATG GAC 2 976 
Met Asp 

2991 



(2) INFORMATION FOR SEQ ID NO: 123: 

25 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 996 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



Met Glu Arg Pro Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu 

15 10 15 

Met Arg Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr 

20 25 30 

40 Gin His Arg Glu Leu Asp Leu Lys He Ala He Lys Ser Cys Arg Leu 

35 40 45 

Glu Leu Ser Thr Lys Asn Arg Glu Arg Trp Cys His Glu He Gin He 

50 55 60 

Met Lys Lys Leu Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro 

45 65 70 75 80 

Glu Glu Leu Asn He Leu He His Asp Val Pro Leu Leu Ala Met Glu 

85 90 95 

Tyr Cys Ser Gly Gly Asp Leu Ara Lvs Len Lei: Asr. Lvn Pr'- Ar^ 



55 



Leu 
145 



1. J. ^ 



^ J J 



14 C 



Pro Glu Asn He 

15 0 

lie Tie A r-^ p I . ^ : 



v'al Leu Glr. Asp Val Gly Gly Lys He He 



155 



15 0 
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Leu Cys Thr Ser 

180 

Phe Glu Asn Lys 
195 

Thr Met Val Phe 
210 

Leu Gin Pro Phe 
225 

Cys He Phe Ala 

His Leu Pro Gin 

260 

Glu Asn Trp Leu 
275 

Gly Pro Val Asp 
290 

Asp His He Leu 
305 

Ala Lys He He 

Leu Gin Ser Arg 

340 

Glu Leu Leu Ser 
355 

Ser Gin Cys Val 
370 

Tyr Leu Phe Asp 
385 

Arg Ser Leu Ser 

Gin Leu Pro He 

420 

Tyr Val Ser Gly 
435 

Arg Ala Ala Met 
450 

Met Lys Asn Thr 
465 

Glu Phe Phe His 

Gin Met Thr Tyr 

500 

Glu Met Glu Glu 
515 

Tyr Leu Glu Asp 
530 

Gin Lvs Ser Pro 



165 

Phe Val Gly Thr 

Pro Tyr Thr Ala 

200 

Glu Cys He Ala 
215 

Thr Trp His Glu 
230 

Cys Glu Glu Met 
245 

Pro Asn Ser Leu 

Gin Leu Met Leu 

280 

Leu Thr Leu Lys 
295 

Asn Leu Lys He 
310 

Ser Phe Leu Leu 
325 

He Glu Arg Glu 

Glu Thr Gly He 

360 

Leu Asp Gly Val 
375 

Lys Ser Lys Thr 
390 

Asp Cys Val Asn 
405 

He Gin Leu Arg 

Leu Lys Glu Asp 

440 

Leu Ser Leu Leu 
455 

Leu He Ser Ala 
470 

Lys Ser He Gin 
485 

Gly He Ser Ser 

Lys Ala He His 

520 

Gin He Met Ser 
535 

Tvr Glv Arrr A'^n 



239 

170 

Leu Gin Tyr Leu 
185 

Thr Val Asp Tyr 

Gly Tyr Arg Pro 

220 

Lys He Lys Lys 
235 

Ser Gly Glu Val 
250 

Cys Ser Leu He 
265 

Asn Trp Asp Pro 

Gin Pro Arg Cys 

300 

Val His He Leu 
315 

Pro Pro Asp Glu 
330 

Thr Gly He Asn 
345 

Ser Leu Asp Pro 

Arg Gly Cys Asp 

Ton 

^ w 

Val Tyr Glu Gly 
395 

Tyr He Val Gin 
410 

Lys Val Trp Ala 
425 

Tyr Ser Arg Leu 

Arg Tyr Asn Ala 

460 

Ser Gin Gin Leu 
475 

Leu Asp Leu Glu 
490 

Glu Lys Met Leu 
505 

Tyr Ala Glu Val 

Leu His Ala Glu 

540 

r!i-- r;^-. Ar^^ 



175 

Ala Pro Glu Leu 
190 

Trp Ser Phe Gly 
205 

Phe Leu His His 

Lys Asp Pro Lys 

240 

Arg Phe Ser Ser 
255 

Val Glu Pro Met 
270 

Gin Gin Arg Gly 
285 

Phe Val Leu Met 

Asn Met Thr Ser 

320 

Ser Leu His Ser 
335 

Thr Gly Ser Gin 
350 

Arg Lys Pro Ala 
365 

Ser Tyr Met Val 

Pro Phe Ala Ser 

400 

Asp Ser Lys He 
415 

Glu Ala Val His 
430 

Phe Gin Gly Gin 
445 

Asn Leu Thr Lys 

Lys Ala Lys Leu 

480 

Arg Tyr Ser Glu 
495 

Lys Ala Trp Lys 
510 

Gly Val He Gly 
525 

He Met Gly Leu 



580 565 59 0 

Thr Val Gin Ser Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu 
595 600 605 
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610 615 620 

Val Glu Val Ala Leu Ser Asn lie Lys Glu Ala Asp Asn Thr Val Met 
625 630 635 640 

Phe Met Gin Gly Lys Arg Gin Lys Glu lie Trp His Leu Leu Lys lie 
5 645 650 655 

Ala Cys Thr Gin Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu 

660 665 670 

Gly Ala Val Thr Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala 
675 680 685 

10 Glu His Asp His Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu 
690 695 700 

Thr Ser Ala Gin Met lie Glu Glu Asn Leu Asn Cys Leu Gly His Leu 
705 710 715 720 

Ser Thr lie He His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met 
15 725 730 735 

Asn Leu Asp Trp Ser Trp Leu Thr Glu Trp Val Pro Arg Ala Arg Asp 

740 745 750 

Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
755 760 765 

20 Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
770 775 780 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
785 790 795 800 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
25 805 810 815 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

820 825 830 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
835 840 845 

30 Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
850 855 860 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
865 870 875 880 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
35 885 890 895 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 

900 905 910 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
915 920 925 

40 He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
930 935 940 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
945 950 955 960 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
45 965 970 975 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

980 985 990 

Glu Leu Tyr Lys 



: . SEQUENCE CiiAKACTERISTICS : 
;a) LENGTH: 1908 base pairs 
55 (B) TYPE: nucleic acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

5 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1905 
(D) OTHER INFORMATION: 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 4: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

20 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
25 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
30 65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

35 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

40 GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3B4 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

45 He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAT AAO CA^- AAn ^A^ " " 



55 



..e /^.ra His Asn 1 xe G.u Ano G^'/ Ser 



165 

r^'P^ ^^(" r^TT^ r*^'~'r^ ^T^r^ 



17C 



175 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAG AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG 62 4 

5 Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

AGC AAA GAG CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 67 2 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
10 210 215 220 

GTG ACC GCC GCC GGG ATG ACT CTG GGC ATG GAC GAG CTG TAG AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

225 230 235 240 

15 

GGA CTC AGA TCT CGA GCT CAA GCT TCC ATG AGC GAG ACG GTC ATG ATG 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ser Glu Thr Val He Met 

245 250 255 

20 AGC GAG ACG GTC ATC TGT TCC AGC CGG GCC ACT GTG ATG CTT TAT GAT 816 

Ser Glu Thr Val lie Cys Ser Ser Arg Ala Thr Val Met Leu Tyr Asp 

260 265 270 

GAT GGC AAC AAG CGA TGG CTC CCT GCT GGC ACG GGT CCC CAG GCC TTC 864 
25 Asp Gly Asn Lys Arg Trp Leu Pro Ala Gly Thr Gly Pro Gin Ala Phe 

275 280 285 

AGC CGC GTC CAG ATC TAG CAC AAC CCC ACG GCC AAT TCC TTT CGC GTC 912 
Ser Arg Val Gin He Tyr His Asn Pro Thr Ala Asn Ser Phe Arg Val 
30 290 295 300 

GTG GGC CGG AAG ATG CAG CCC GAC CAG CAG GTG GTC ATC AAC TGT GCC 96 0 

Val Gly Arg Lys Met Gin Pro Asp Gin Gin Val Val He Asn Cys Ala 

305 310 315 320 

35 

ATC GTC CGG GGT GTG AAG TAT AAC CAG GCC ACC CCC AAC TTC CAT CAG 100 8 

He Val Arg Gly Val Lys Tyr Asn Gin Ala Thr Pro Asn Phe His Gin 

325 330 335 

40 TGG CGC GAC GCT CGC CAG GTC TGG GGC CTC AAC TTC GGC AGC AAG GAG 105 6 

Trp Arg Asp Ala Arg Gin Val Trp Gly Leu Asn Phe Gly Ser Lys Glu 

340 345 350 

GAT GCG GCC CAG TTT GCC GCC GGC ATG GCC AGT GCC CTA GAG GCG TTG 1104 
45 Asp Ala Ala Gin Phe Ala Ala Gly Met Ala Ser Ala Leu Glu Ala Leu 

355 360 365 

GAA GGA GGT GGG CCC CCT CCA CCC CCA GCA CTT CCC ACC TGG TCG GTC ^. 1^? 



r- r ^ . u 



A r q 'J 



55 



385 



39C 



3 95 



400 



CCC GGC CCG T: 



CAC 



'A 



-3 1 \ 
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Pro Gly Pro Ser Glu His lie Glu Arg Arg Val Ser Asn Ala Gly Gly 

405 410 415 

CCA CCT GCT CCC CCC OCT GGG GGT CCA CCC CCA CCA CCA GGA CCT CCC 12 96 

5 Pro Pro Ala Pro Pro Ala Gly Gly Pro Pro Pro Pro Pro Gly Pro Pro 

420 425 430 

CCT CCT CCA GGT CCC CCC CCA CCC CCA GGT TTG CCC CCT TCG GGG GTC 13 44 

Pro Pro Pro Gly Pro Pro Pro Pro Pro Gly Leu Pro Pro Ser Gly Val 
10 435 440 445 

CCA GCT GCA GCG CAC GGA GCA GGG GGA GGA CCA CCC CCT GCA CCC CCT 13 92 

Pro Ala Ala Ala His Gly Ala Gly Gly niy Pro Pro Pro Ala Pro Pro 
450 455 460 

15 

CTC CCG GCA GCA CAG GGC CCT GGT GGT GGG GGA GCT GGG GCC CCA GGC 1440 

Leu Pro Ala Ala Gin Gly Pro Gly Gly Gly Gly Ala Gly Ala Pro Gly 
465 470 475 480 

20 CTG GCC GCA GCT ATT GCT GGA GCC AAA CTC AGG AAA GTC AGC AAG CAG 14 8 8 

Leu Ala Ala Ala lie Ala Gly Ala Lys Leu Arg Lys Val Ser Lys Gin 

485 490 495 

GAG GAG GCC TCA GGG GGG CCC ACA GCC CCC AAA GCT GAG AGT GGT CGA 153 6 

25 Glu Glu Ala Ser Gly Gly Pro Thr Ala Pro Lys Ala Glu Ser Gly Arg 

500 505 510 

AGC GGA GGT GGG GGA CTC ATG GAA GAG ATG AAC GCC ATG CTG GCC CGG 15 84 

Ser Gly Gly Gly Gly Leu Met Glu Glu Met Asn Ala Met Leu Ala Arg 
30 515 520 525 

AGA AGG AAA GCC ACG CAA GTT GGG GAG AAA ACC CCC AAG GAT GAA TCT 1632 

Arg Arg Lys Ala Thr Gin Val Gly Glu Lys Thr Pro Lys Asp Glu Ser 
530 535 540 

35 

GCC AAT CAG GAG GAG CCA GAG GCC AGA GTC CCG GCC CAG AGT GAA TCT 16 80 

Ala Asn Gin Glu Glu Pro Glu Ala Arg Val Pro Ala Gin Ser Glu Ser 

545 550 555 560 

40 GTG CGG AGA CCC TGG GAG AAG AAC AGC ACA ACC TTG CCA AGG ATG AAG 172 8 

Val Arg Arg Pro Trp Glu Lys Asn Ser Thr Thr Leu Pro Arg Met Lys 

565 570 575 

TCG TCT TCT TCG GTG ACC ACT TCC GAG ACC CAA CCC TGC ACG CCC AGC 177 6 
45 Ser Ser Ser Ser Val Thr Thr Ser Glu Thr Gin Pro Cys Thr Pro Ser 

580 585 590 



A^'^ HAT ""A^ 



55 



610 



615 



'S Va: Lys Gl'^ Giu lie lie Glu Al 



620 
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Phe Val Gin Glu Leu Arg Lys Arg Gly Ser Pro 
625 630 635 



5 (2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

20 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
25 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

30 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
35 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

40 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
45 210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

^Av Le)i Ara Ser Ara AT ^ Gin A1 r M*^*" ^o>- rTi: Th^ Vi ^ T "! Me^ 



55 



27 



er Arg Val Gin He 
290 



280 2BB 
'yr Hi3 Agh Pro Thr Ala Acn Ser Phe Arg Val 



295 



300 
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305 
lie 

Trp 

5 

Asp 

Glu 

10 Pro 
385 
Pro 

Pro 

15 

Pro 
Pro 

20 Leu 
465 
Leu 

Glu 

25 

Ser 

Arg 

30 Ala 
545 

Val 

Ser 

35 

Ser 
Glu 

40 Phe 

625 



Val Arg Gly 

Arg Asp Ala 
340 

Ala Ala Gin 
355 

Gly Gly Gly 
370 

Asn Gly Pro 

Gly Pro Ser 

Pro Ala Pro 
420 

Pro Pro Gly 
435 

Ala Ala Ala 
450 

Pro Ala Ala 

Ala Ala Ala 

Glu Ala Ser 
500 

Gly Gly Gly 
515 

Arg Lys Ala 
530 

Asn Gin Glu 

Arg Arg Pro 

Ser Ser Ser 
580 

Ser Asp Tyr 

595 

Val Lys Lys 
610 

Val Gin Glu 



310 

Val Lys Tyr 
325 

Arg Gin Val 

Phe Ala Ala 

Pro Pro Pro 
375 

Ser Pro Glu 
390 

Glu His He 
405 

Pro Ala Gly 

Pro Pro Pro 

His Gly Ala 
455 

Gin Gly Pro 

470 
He Ala Gly 
485 

Gly Gly Pro 

Gly Leu Met 

Thr Gin Val 
535 

Glu Pro Glu 
550 

Trp Glu Lys 
565 

Val Thr Thr 

Ser Asp Leu 

Glu Leu Gin 
615 

Leu Arg Lys 
630 



245 

Asn Gin Ala 
330 

Trp Gly Leu 
345 

Gly Met Ala 
360 

Pro Pro Ala 

Glu Val Glu 

Glu Arg Arg 
410 

Gly Pro Pro 

425 
Pro Pro Gly 
440 

Gly Gly Gly 

Gly Gly Gly 

Ala Lys Leu 
490 

Thr Ala Pro 
505 

Glu Glu Met 
520 

Gly Glu Lys 

Ala Arg Val 

Asn Ser Thr 
570 

Ser Glu Thr 
585 

Gin Arg Val 
600 

Lys Val Lys 
Arg Gly Ser 



315 

Thr Pro Asn 

Asn Phe Gly 

Ser Ala Leu 
365 

Leu Pro Thr 
380 

Gin Gin Lys 
395 

Val Ser Asn 

Pro Pro Pro 

Leu Pro Pro 
445 

Pro Pro Pro 
460 

Gly Ala Gly 
475 

Arg Lys Val 

Lys Ala Glu 

Asn Ala Met 
525 

Thr Pro Lys 
540 

Pro Ala Gin 
555 

Thr Leu Pro 

Gin Pro Cys 

Lys Gin Glu 
605 

Glu Glu He 

620 

Pro 
635 



320 

Phe His Gin 

335 
Ser Lys Glu 
350 

Glu Ala Leu 

Trp Ser Val 

Arg Gin Gin 
400 

Ala Gly Gly 

415 
Gly Pro Pro 
430 

Ser Gly Val 

Ala Pro Pro 

Ala Pro Gly 
480 

Ser Lys Gin 
495 

Ser Gly Arg 
510 

Leu Ala Arg 

Asp Glu Ser 

Ser Glu Ser 
560 

Arg Met Lys 
575 

Thr Pro Ser 
590 

Leu Leu Glu 

lie Glu Ala 



(2) INFORMATION FOR SEQ ID NO: 12 6: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 
(O 5TRANDEDNESS : ni^nl'- 



(A) NAME/KEY: Coding Sequence 
55 (B) LOCATION: 1...1326 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

5 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 9 6 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
10 20 25 30 



15 



GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC 14 4 

Glu Gly Glu Gly Acp Ala Thr Tyr Gly Lye Leu Thr Leu Lys Phe lie 
35 40 45 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

20 CTG ACC TAC GGC GTG GAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
25 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

B5 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
30 100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

35 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

40 AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

45 Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 

165 170 175 



55 



Pre Val Leu Leu Pro Asp 
195 



AE3n His 
20C 



■r Leu 



Ser 



Gin 
2 05 



er Ala L e 
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Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAG GAG CTG TAG AAG TCC 72 0 

5 Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCA ATG GCT GCC ATC CGG AAG AAA 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ala Ala lie Arg Lys Lys 
10 245 250 255 

CTG GTG ATT GTT GGT GAT GGA GCC TGT GGA AAG ACA TGC TTG CTC ATA 816 

Leu Val lie Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He 

260 265 270 

15 

GTC TTC AGC AAG GAC CAG TTC CCA GAG GTG TAT GTG CCC ACA GTG TTT 864 

Val Phe Ser Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe 
275 280 285 

20 GAG AAC TAT GTG GCA GAT ATC GAG GTG GAT GGA AAG CAG GTA GAG TTG 912 
Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val Glu Leu 
290 295 300 

GCT TTG TGG GAC ACA GCT GGG CAG GAA GAT TAT GAT CGC CTG AGG CCC 960 
25 Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro 
305 310 315 320 

CTC TCC TAG CCA GAT ACC GAT GTT ATA CTG ATG TGT TTT TCC ATC GAC 100 6 
Leu Ser Tyr Pro Asp Thr Asp Val He Leu Met Cys Phe Ser He Asp 
30 325 330 335 

AGC CCT GAT AGT TTA GAA AAC ATC CCA GAA AAG TGG ACC CCA GAA GTC 105 6 

Ser Pro Asp Ser Leu Glu Asn He Pro Glu Lys Trp Thr Pro Glu Val 

340 345 350 

35 

AAG CAT TTC TGT CCC AAC GTG CCC ATC ATC CTG GTT GGG AAT AAG AAG 1104 

Lys His Phe Cys Pro Asn Val Pro He He Leu Val Gly Asn Lys Lys 
355 360 365 

40 GAT CTT CGG AAT GAT GAG CAC ACA AGG CGG GAG CTA GCC AAG ATG AAG 115 2 

Asp Leu Arg Asn Asp Glu His Thr Arg Arg Glu Leu Ala Lys Met Lys 
370 375 380 

CAG GAG CCG GTG AAA CCT GAA GAA GGC AGA GAT ATG GCA AAC AGG ATT 12 00 

45 Gin Glu Pro Val Lys Pro Glu Glu Gly Arg Asp Met Ala Asn Arg He 
385 390 395 400 

GGC GCT TTT GGG TAC ATG GAG TGT TCA GCA AAG ACC AAA GAT GGA GT^^ "?4« 



55 



4 2 0 



425 



eu G.:: /\^a /^rg /-^r^^ 
43 0 
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Gly Lys Lys Lys Ser Gly Cys Leu Val Leu 
435 440 



5 (2) INFORMATION FOR SEQ ID NO: 127 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

20 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
25 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 eo 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

65 90 95 

30 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
35 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

40 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
45 210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

r;lv Arn Arc AV"? '^"'n A'' - ."^ov- Mnh n n o A'^'n \' 



275 28C 2Bb 

Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val Glu Leu 
55 290 295 300 
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305 310 315 320 

Leu Ser Tyr Pro Asp Thr Asp Val lie Leu Met Cys Phe Ser lie Asp 

325 330 335 

Ser Pro Asp Ser Leu Glu Asn lie Pro Glu Lys Trp Thr Pro Glu Val 
5 340 345 350 

Lys His Phe Cys Pro Asn Val Pro lie lie Leu Val Gly Asn Lys Lys 

355 360 365 

Asp Leu Arg Asn Asp Glu His Thr Arg Arg Glu Leu Ala Lys Met Lys 
370 375 380 

10 Gin Glu Pro Val Lys Pro Glu Glu Gly Arg Asp Met Ala Asn Arg lie 
385 390 395 400 

Gly Ala Phe Gly Tyr Met Glu Cys Ser Ala Lys Thr Lys Asp Gly Val 

405 410 415 

Arg Glu Val Phe Glu Met Ala Thr Arg Ala Ala Leu Gin Ala Arg Arg 
15 420 425 430 

Gly Lys Lys Lys Ser Gly Cys Leu Val Leu 
435 440 



20 



35 



(2) INFORMATION FOR SEQ ID NO: 12 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

30 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1137 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 

ATG GAC CAT TAT GAT TCT CAG CAA ACC AAC GAT TAC ATG CAG CCA GAA 4 8 

Met Asp His Tyr Asp Ser Gin Gin Thr Asn Asp Tyr Met Gin Pro Glu 
15 10 15 

40 GAG GAC TGG GAC CGG GAC CTG CTC CTG GAC CCG GCC TGG GAG AAG CAG 9 6 

Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys Gin 

20 25 30 

CAG AGA AAG ACA TTC ACG GCA TGG TGT AAC TCC CAC CTC CGG AAG GCG 14 4 

45 Gin Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys Ala 

35 40 45 

'^GG ACA CAH T\'^C GAC AA^ AT'^ HAA ^-AC r;?." -nrr-^ '^n^ n.A"" r-T-^^ A A" ' ^" 



Leu Met Lc-j Leu Leu Glu Vai lie Ser Gly Giu Arg Leu A^a Lys Pr^ 
65 7C 75 80 



55 



SUBSTITUTE SHEET (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



250 

Glu Arg Gly Lys Met Arg Val His Lys He Ser Asn Val Asn Lys Ala 

85 90 95 

CTG GAT TTC ATA GCC AGC AAA GGC GTC AAA CTG GTG TCC ATC GGA GCC 33 6 

5 Leu Asp Phe He Ala Ser Lys Gly Val Lys Leu Val Ser lie Gly Ala 

100 105 110 

GAA GAA ATC GTG GAT GGG AAT GTG AAG ATG ACC CTG GGC ATG ATC TGG 3 84 

Glu Glu He Val Asp Gly Asn Val Lys Met Thr Leu Gly Met He Trp 
10 115 120 125 

ACC ATC ATC CTG CGC AGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG 43 2 

Thr He He Leu Arg Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys 

130 135 140 

15 

GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC 480 

Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 

145 150 155 160 

20 GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC 52 8 

Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly 

165 170 175 

GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC 576 
25 Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly 

180 185 190 

AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC 624 
Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly 
30 195 200 205 

GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC 672 

Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe 

210 215 220 

35 

TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC 72 0 

Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe 

225 230 235 240 

40 TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG 768 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 

245 250 255 



45 



GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG 816 

Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys 

260 265 270 

GAG GAC GGC AAC ATC CT^ CiC^I AA^- ^T^^ n.r,^ ""A'" AA^ '^A^ AA^ AC-^ 



55 



:^ 90 



Xe*: Ala Asp Ly 
295 



G Gin Lys Asn Gly I i e 
300 



s Val 
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Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala 
305 310 315 320 

GAC CAC TAG GAG GAG AAC ACC CCG ATC GGC GAG GGC CCG GTG CTG CTG 10 0 8 
5 Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu 

325 330 335 

CCC GAC AAC CAC TAG CTG AGC ACC GAG TCC GCC CTG AGC AAA GAC CCC 105 6 

Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro 
10 340 345 350 

AAC GAG AAG GGC GAT CAC ATG GTG CTG CTG GAG TTC GTG ACC GCC GCC 1104 
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 
355 360 365 



15 



20 



GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TAA 114 0 

Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
370 375 



(2) INFORMATION FOR SEQ ID NO: 12 9: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 79 amino acids 

25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
30 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

Met Asp His Tyr Asp Ser Gin Gin Thr Asn Asp Tyr Met Gin Pro Glu 
35 1 5 10 15 

Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys Gin 

20 25 30 

Gin Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys Ala 
35 40 45 

40 Gly Thr Gin lie Glu Asn lie Glu Glu Asp Phe Arg Asp Gly Leu Lys 
50 55 60 

Leu Met Leu Leu Leu Glu Val lie Ser Gly Glu Arg Leu Ala Lys Pro 

65 70 75 80 

Glu Arg Gly Lys Met Arg Val His Lys lie Ser Asn Val Asn Lys Ala 

45 85 90 95 

Leu Asp Phe lie Ala Ser Lys Gly Val Lys Leu Val Ser lie Gly Ala 

100 105 110 

Glu Glu Tie Val Asn Glv Asn Val Tvq v*^*- ^. ^-V/ ^^r 



- 4 

Gly Asp Val As: 



55 



15 0 
Gly His 
165 



15 5 16 0 

-ys Phe Scr Val Ser Giy Glu Gly Glu Gly 

170 175 



t \ ,. '. i . . . 
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180 IBS 190 

Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly 

195 200 205 

Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe 

210 215 220 

Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe 
225 230 235 240 

Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 

245 250 255 

Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys 

260 265 270 

Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser 

275 ?eo 

His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val 

290 295 300 

Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala 
305 310 315 320 

Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu 

325 330 335 

Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro 

340 345 350 

Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 

355 360 365 

Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
370 375 

{2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . , . 3513 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 

ATG GTG AGO AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



■i'^ Giy Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

40 45 
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Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAG GGC GTG CAG TGC TTC AGO CGC TAG CCC GAG CAC ATG AAG 24 0 

5 Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAG GTC CAG GAG 28 8 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
10 85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAG AAG ACC CGC GCC GAG 336 

Ara Thr Tie Phe Phe Lvs Asn Aan GTv Asn Tvr T.vk Thr Arcs A^ r GTii 

100 105 110 

15 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

20 ATC GAC TTC AAG GAG GAC GGC AAC . ATC CTG GGG CAC AAG CTG GAG TAG 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 48 0 

25 Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
30 165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

35 

CCC GTG CTG CTG CCC GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG 624 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

40 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

45 Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



55 



Asp Gly Gly Asp Ala Fro Gly Kis Glu Pro Gly Gly Ser Pro Gin A; 



260 



265 



27C 
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Glu Leu Asp Phe Ser lie Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn 
275 280 285 

GAA GAA GAG CCG AAT GCA CAT AAG GTC GCC AGC CCA CCC TCC GGA CCC 912 
5 Glu Glu Glu Pro Asn Ala His Lys Val Ala Ser Pro Pro Ser Gly Pro 
290 295 300 

GCA TAC CCC GAT GAT GTA ATG GAC TAT GGC CTC AAG CCA TAG AGC CCC 960 
Ala Tyr Pro Asp Asp Val Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro 
10 305 310 315 320 

CTT GCT AGT CTC TCT GGC GAG CCC CCC GGC CGA TTC GGA GAG CCG GAT 1008 
Leu Ala Ser Leu Ser Gly Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp 

325 330 335 

15 

AGG GTA GGG CCG CAG AAG TTT CTG AGC GCG GCC AAG CCA GCA GGG GCC 1056 
Arg Val Gly Pro Gin Lys Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala 

340 345 350 

20 TCG GGC CTG AGC CCT CGG ATC GAG ATC ACT CCG TCC CAC GAA CTG ATC 1104 

Ser Gly Leu Ser Pro Arg lie Glu lie Thr Pro Ser His Glu Leu He 
355 360 365 

CAG GCA GTG GGG CCC CTC CGC ATG AGA GAC GCG GGC CTC CTG GTG GAG 1152 
25 Gin Ala Val Gly Pro Leu Arg Met Arg Asp Ala Gly Leu Leu Val Glu 
370 375 380 

CAG CCT CCC CTG GCC GGG GTG GCC GCC AGC CCG AGG TTC ACC CTG CCC 12 0 0 
Gin Pro Pro Leu Ala Gly Val Ala Ala Ser Pro Arg Phe Thr Leu Pro 
30 385 390 395 400 

GTG CCC GGC TTC GAG GGC TAC CGC GAG CCG CTT TGC TTG AGC CCC GCT 12 4 8 

Val Pro Gly Phe Glu Gly Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala 

405 410 415 

35 

AGC AGC GGC TCC TCT GCC AGC TTC ATT TCT GAC ACC TTC TCC CCC TAC 12 96 

Ser Ser Gly Ser Ser Ala Ser Phe He Ser Asp Thr Phe Ser Pro Tyr 

420 425 430 

40 ACC TCG CCC TGC GTC TCG CCC AAT AAC GGC GGG CCC GAC GAC CTG TGT 13 44 

Thr Ser Pro Cys Val Ser Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys 
435 440 445 

CCG CAG TTT CAA AAC ATC CCT GCT CAT TAT TCC CCC AGA ACC TCG CCA 13 92 
45 Pro Gin Phe Gin Asn He Pro Ala His Tyr Ser Pro Arg Thr Ser Pro 
450 455 460 

ATA ATG TCA CCT CGA ACC AGC CTC GCC GAG GAC AGC TGC CTG GGC CG"^ "^44^ 



55 



i^.^j ^ ^ -cii ii. Lj i\ L L J. ^ ^t_j. Ot:..^ t ^ i. y t-\ ^ ^ 

485 490 495 

AAG C"-^ Ar^"^ '^A" ^CG T^^ ^-^^ ^^^^ ^"^^ -^"^-^ ^^-^ -^"^ ^"^-^ "'^^ 
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Lys Arg Arg His Ser Cys Ala Glu Ala Leu Val Ala Leu Pro Pro Gly 

500 505 510 

GCC TCA CCC CAG CGC TCC CGG AGC CCC TCG CCG CAG CCC TCA TCT CAC 15 84 

5 Ala Ser Pro Gin Arg Ser Arg Ser Pro Ser Pro Gin Pro Ser Ser His 

515 520 525 

GTG GCA CCC CAG GAC CAC GGC TCC CCG GCT GGG TAG CCC CCT GTG GCT 16 3 2 

Val Ala Pro Gin Asp His Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala 

10 530 535 540 

GGC TCT GCC GTG ATC ATG GAT GCC CTG AAC AGC CTC GCC ACG GAC TCG 16 8 0 

Gly Ser Ala Val lie Met Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser 
545 550 555 560 

15 

CCT TGT GGG ATC CCC CCC AAG ATG TGG AAG ACC AGC CCT GAC CCC TCG 172 8 

Pro Cys Gly lie Pro Pro Lys Met Trp Lys Thr Ser Pro Asp Pro Ser 

565 570 575 

20 CCG GTG TCT GCC GCC CCA TCC AAG GCC GGC CTG CCT CGC CAC ATC TAG 177 6 

Pro Val Ser Ala Ala Pro Ser Lys Ala Gly Leu Pro Arg His lie Tyr 

580 585 590 

CCG GCC GTG GAG TTC CTG GGG CCC TGC GAG CAG GGC GAG AGG AGA AAC 182 4 

25 Pro Ala Val Glu Phe Leu Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn 

595 600 605 

TCG GCT CCA GAA TCC ATC CTG CTG GTT CCG CCC ACT TGG CCC AAG CCG 1872 

Ser Ala Pro Glu Ser lie Leu Leu Val Pro Pro Thr Trp Pro Lys Pro 

30 610 615 620 

CTG GTG CCT GCC ATT CCC ATC TGC AGC ATC CCA GTG ACT GCA TCC CTC 192 0 

Leu Val Pro Ala lie Pro lie Cys Ser He Pro Val Thr Ala Ser Leu 
625 630 635 640 

35 

CCT CCA CTT GAG TGG CCG CTG TCC AGT CAG TCA GGC TCT TAC GAG CTG 196 8 

Pro Pro Leu Glu Trp Pro Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu 

645 650 655 

40 CGG ATC GAG GTG CAG CCC AAG CCA CAT CAC CGG GCC CAC TAT GAG ACA 2 016 

Arg He Glu Val Gin Pro Lys Pro His His Arg Ala His Tyr Glu Thr 

660 665 670 

GAA GGC AGC CGA GGG GCT GTC AAA GCT CCA ACT GGA GGC CAC CCT GTG 2 064 

45 Glu Gly Ser Arg Gly Ala Val Lys Ala Pro Thr Gly Gly His Pro Val 

675 680 685 



GTT CAG CTC CAT OGT '^A^ A'^H HA A AA^ A A'" 



55 



r n e 
705 



lie G 1 >^ 



Asp 
71C 



1 , , 



Arg lie 



Lys Pro His Ala Phe Tvr 



715 



72 
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Gin Val His Arg He Thr Gly Lys Thr Val Thr Thr Thr Ser Tyr Giu 

725 730 735 

AAG ATA GTG GGC AAC ACC AAA GTC CTG GAG ATC CCC TTG GAG CCC AAA 2256 
5 Lys He Val Gly Asn Thr Lys Val Leu Glu He Pro Leu Glu Pro Lys 

740 745 750 

AAC AAC ATG AGG GCA ACC ATC GAC TGT GCG GGG ATC TTG AAG CTT AGA 2 3 04 
Asn Asn Met Arg Ala Thr He Asp Cys Ala Gly He Leu Lys Leu Arg 
10 755 760 765 

AAC GCC GAC ATT GAG CTG CGG AAA GGC GAG ACG GAC ATT GGA AGA AAG 2 3 52 

Asn TkI^ Asp Tie Glu Leu Arg Lys Gly Glu Thr Acp He Cly Arg Lys 
770 775 780 

15 

AAC ACG CGG GTG AGA CTG GTT TTC CGA GTT CAC ATC CCA GAG TCC AGT 240 0 

Asn Thr Arg Val Arg Leu Val Phe Arg Val His He Pro Glu Ser Ser 
785 790 795 800 

20 GGC AGA ATC GTC TCT TTA CAG ACT GCA TCT AAC CCC ATC GAG TGC TCC 244 8 
Gly Arg He Val Ser Leu Gin Thr Ala Ser Asn Pro He Glu Cys Ser 

805 810 815 

CAG CGA TCT GCT CAC GAG CTG CCC ATG GTT GAA AGA CAA GAC ACA GAC 24 96 

25 Gin Arg Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Thr Asp 

820 825 830 

AGC TGC CTG GTC TAT GGC GGC CAG CAA ATG ATC CTC ACG GGG CAG AAC 2544 
Ser Cys Leu Val Tyr Gly Gly Gin Gin Met He Leu Thr Gly Gin Asn 
30 835 840 845 

TTT ACA TCC GAG TCC AAA GTT GTG TTT ACT GAG AAG ACC ACA GAT GGA 25 92 

Phe Thr Ser Glu Ser Lys Val Val Phe Thr Glu Lys Thr Thr Asp Gly 

850 855 860 

35 

CAG CAA ATT TGG GAG ATG GAA GCC ACG GTG GAT AAG GAC AAG AGC CAG 2 64 0 

Gin Gin He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin 

865 870 875 880 

40 CCC AAC ATG CTT TTT GTT GAG ATC CCT GAA TAT CGG AAC AAG CAT ATC 2 6 88 
Pro Asn Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He 

885 890 895 

CGC ACA CCT GTA AAA GTG AAC TTC TAC GTC ATC AAT GGG AAG AGA AAA 2 7 36 
45 Arg Thr Pro Val Lys Val Asn Phe Tyr Val He Asn Gly Lys Arg Lys 

900 905 910 



55 



Thr Giu Pre Thr Asp Gl 
930 



Tyr Asp Pre Thr Leu He Cy: 
935 940 



Ser Fro Thr 
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His Gly Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Val 
945 950 955 960 

GCC GAG TCC CCC TCC TGC CTC GTG GCC ACC ATG GCT CCC TGC CAG GAG 2 92 8 
5 Ala Glu Ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin 

965 970 975 

TTC CGC ACG GGG CTC TCA TCC CCT GAC GCC CGC TAG CAG CAA CAG AAC 2 976 
Phe Arg Thr Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn 
10 980 985 990 

CCA GCG GCC GTA CTC TAC CAG CGG AGC AAG AGC CTG AGC CCC AGC CTG 3 024 
Pro Ala Ala Val Leu Tvr Gin Arn Qfi-r T.\rc Coi- T.oii Ca^ n-r-r^ c^->^ t«,, 

995 1000 1005 

15 

CTG GGC TAT CAG CAG CCG GCC CTC ATG GCC GCC CCG CTG TCC CTT GCG 3 072 

Leu Gly Tyr Gin Gin Pro Ala Leu Met Ala Ala Pro Leu Ser Leu Ala 
1010 1015 1020 

20 GAC GCT CAC CGC TCT GTG CTG GTG CAC GCC GGC TCC CAG GGC CAG AGC 3120 
Asp Ala His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Ser 
1025 1030 1035 1040 

TCA GCC CTG CTC CAC CCC TCT CCG ACC AAC CAG CAG GCC TCG CCT GTG 3168 
25 Ser Ala Leu Leu His Pro Ser Pro Thr Asn Gin Gin Ala Ser Pro Val 

1045 1050 1055 

ATC CAC TAC TCA CCC ACC AAC CAG CAG CTG CGC TGC GGA AGC CAC CAG 3 216 
lie His Tyr Ser Pro Thr Asn Gin Gin Leu Arg Cys Gly Ser His Gin 
30 1060 1065 1070 

GAG TTC CAG CAC ATC ATG TAC TGC GAG AAT TTC GCA CCA GGC ACC ACC 3 2 64 

Glu Phe Gin His lie Met Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr 
1075 1080 1085 

35 

AGA CCT GGC CCG CCC CCG GTC AGT CAA GGT CAG AGG CTG AGC CCG GGT 3 312 

Arg Pro Gly Pro Pro Pro Val Ser Gin Gly Gin Arg Leu Ser Pro Gly 
1090 1095 1100 

40 TCC TAC CCC ACA GTC ATT CAG CAG CAG AAT GCC ACG AGC CAA AGA GCC 3 3 60 
Ser Tyr Pro Thr Val lie Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala 
1105 1110 1115 1120 

GCC AAA AAC GGA CCC CCG GTC AGT GAC CAA AAG GAA GTA TTA CCT GCG 3 4 08 

45 Ala Lys Asn Gly Pro Pro Val Ser Asp Gin Lys Glu Val Leu Pro Ala 

1125 1130 1135 



As:: \'a.l Asn G^u lie Z^, e Arg Lys Gxu ?ne Ser Gly Pro Pro A^. a Arg 
1155 1160 1165 



55 
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Asn Gin Thr 
1170 



5 (2) INFORMATION FOR SEQ ID NO: 131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE' protein 
(v) FRAGMENT TYPE; internal 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

20 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
25 50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

tr c n f\ n c on 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

30 Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
35 130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

40 Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
45 210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



275 28C 285 

Glu Glu Glu Pro Agii Ala His Lys Val Ala Ser Pro Pro Ser Gly Pro 
55 290 295 300 
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10 



15 



20 



25 



30 



35 



40 



45 



305 

Leu Ala Ser 

Arg Val Gly 

Ser Gly Leu 
355 

Gin Ala Val 
370 

Gin Pro Pro 
385 

Val Pro Gly 

Ser Ser Gly 

Thr Ser Pro 
435 

Pro Gin Phe 

450 
lie Met Ser 
465 

His Ser Pro 

Lys Arg Arg 

Ala Ser Pro 
515 

Val Ala Pro 

530 
Gly Ser Ala 
545 

Pro Cys Gly 

Pro Val Ser 

Pro Ala Val 
595 

Ser Ala Pro 

610 

Leu Val Pro 
625 

Pro Pro Leu 

Arg lie Glu 

Glu Gly Ser 
675 



310 

Leu Ser Gly 
325 

Pro Gin Lys 
340 

Ser Pro Arg 

Gly Pro Leu 

Leu Ala Gly 
390 

Phe Glu Gly 

405 
Ser Ser Ala 
420 

Cys Val Ser 

Gin Asn lie 

Pro Arg Thr 
470 

Val Pro Arg 

485 
His Ser Cys 
500 

Gin Arg Ser 

Gin Asp His 

Val He Met 
550 

He Pro Pro 
565 

Ala Ala Pro 
580 

Glu Phe Leu 

Glu Ser He 

Ala He Pro 
630 

Glu Trp Pro 
645 

Val Gin Pro 
660 

Arg Gly Ala 



Glu Pro 

Phe Leu 

He Glu 
360 
Arg Met 
375 

Val Ala 

Tyr Arg 

Ser Phe 

Pro Asn 
440 
Pro Ala 
455 

Ser Leu 

Pro Ala 

Ala Glu 

Arg Ser 
c; o n 

*^ <b W 

Gly Ser 
535 

Asp Ala 

Lys Met 

Ser Lys 

Gly Pro 
600 

Leu Leu 
615 

He Cys 

Leu Ser 

Lys Pro 

Val Lys 
680 



315 

Pro Gly Arg 
330 

Ser Ala Ala 
345 

He Thr Pro 

Arg Asp Ala 

Ala Ser Pro 
395 

Glu Pro Leu 

41 n 
He Ser Asp 
425 

Asn Gly Gly 

His Tyr Ser 

Ala Glu Asp 
475 

Ser Arg Ser 
490 

Ala Leu Val 
505 

Pro Ser Pro 

Pro Ala Gly 

Leu Asn Ser 
555 

Trp Lys Thr 
570 

Ala Gly Leu 
585 

Cys Glu Gin 

Val Pro Pro 

Ser He Pro 
635 

Ser Gin Ser 
650 

His His Arg 
665 

Ala Pro Thr 



Phe Gly Glu 

Lys Pro Ala 
350 

Ser His Glu 
365 

Gly Leu Leu 
380 

Arg Phe Thr 

Cys Leu Ser 

Thr Phe Ser 
430 

Pro Asp Asp 
445 

Pro Arg Thr 
460 

Ser Cys Leu 

Ser Ser Pro 

Ala Leu Pro 
510 

Gin Pro Ser 

ETC 

^ «^ 

Tyr Pro Pro 
540 

Leu Ala Thr 

Ser Pro Asp 

Pro Arg His 
590 

Gly Glu Arg 

605 

Thr Trp Pro 
620 

Val Thr Ala 
Gly Ser Tyr 

Ala His Tyr 

670 

Gly Gly His 
685 



320 
Pro Asp 
335 

Gly Ala 

Leu He 

Val Glu 

Leu Pro 
400 
Pro Ala 
415 

Pro Tyr 

Leu Cys 

Ser Pro 

Gly Arg 
480 
Gly Ala 
495 

Pro Gly 

Ser His 

Val Ala 

Asp Ser 
560 
Pro Ser 
575 

He Tyr 

Arg Asn 

Lys Pro 

Ser Leu 
640 
Glu Leu 
655 

Glu Thr 
Pro Val 



55 



"7 " r 



le Veil 



Gly Asn Th 
740 



730 ^35 
Lys Val Leu Glu He Fro Leu Glu Pro Lys 
745 750 
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755 760 765 

Asn Ala Asp lie Glu Leu Arg Lys Gly Glu Thr Asp lie Gly Arg Lys 

770 775 780 

Asn Thr Arg Val Arg Leu Val Phe Arg Val His He Pro Glu Ser Ser 
5 785 790 795 800 

Gly Arg He Val Ser Leu Gin Thr Ala Ser Asn Pro He Glu Cys Ser 

805 810 815 

Gin Arg Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Thr Asp 

820 825 830 

10 Ser Cys Leu Val Tyr Gly Gly Gin Gin Met He Leu Thr Gly Gin Asn 

835 840 845 

Phe Thr Ser Glu Ser Lys Val Val Phe Thr Glu Lys Thr Thr Asp Gly 

850 855 860 

Gin Gin He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin 
15 865 870 875 880 

Pro Asn Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He 

885 890 895 

Arg Thr Pro Val Lys Val Asn Phe Tyr Val He Asn Gly Lys Arg Lys 

900 905 910 

20 Arg Ser Gin Pro Gin His Phe Thr Tyr His Pro Val Pro Ala He Lys 

915 920 925 

Thr Glu Pro Thr Asp Glu Tyr Asp Pro Thr Leu He Cys Ser Pro Thr 

930 935 940 

His Gly Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Val 
25 945 950 955 960 

Ala Glu Ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin 

965 970 375 

Phe Arg Thr Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn 

980 985 990 

30 Pro Ala Ala Val Leu Tyr Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu 

995 1000 1005 

Leu Gly Tyr Gin Gin Pro Ala Leu Met Ala Ala Pro Leu Ser Leu Ala 

1010 1015 1020 

Asp Ala His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Ser 
35 025 1030 1035 1040 

Ser Ala Leu Leu His Pro Ser Pro Thr Asn Gin Gin Ala Ser Pro Val 

1045 1050 1055 

He His Tyr Ser Pro Thr Asn Gin Gin Leu Arg Cys Gly Ser His Gin 

1060 1065 1070 

40 Glu Phe Gin His He Met Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr 

1075 1080 1085 

Arg Pro Gly Pro Pro Pro Val Ser Gin Gly Gin Arg Leu Ser Pro Gly 

1090 1095 1100 

Ser Tyr Pro Thr Val He Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala 
45 105 1110 1115 1120 

Ala Lys Asn Gly Pro Pro Val Ser Asp Gin Lys Glu Val Leu Pro Ala 

1125 1130 1135 

Glv Val Th'T' 1 ^. - - T^.^- ■ ■ 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

10 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .3543 
(D) OTHER INFORMATION: 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

ATG AAC GCC CCC GAG CGG CAG CCC CAA CCC GAC GGC GGG GAC GCC CCA 4 8 

Met Asn Ala Pro Glu Arg Gin Pro Gin Pro Asp Gly Gly Asp Ala Pro 
15 10 15 

20 GGC CAC GAG CCT GGG GGC AGC CCC CAA GAC GAG CTT GAC TTC TCC ATC 96 
Gly His Glu Pro Gly Gly Ser Pro Gin Asp Glu Leu Asp Phe Ser lie 

20 25 30 

CTC TTC GAC TAT GAG TAT TTG AAT CCG AAC GAA GAA GAG CCG AAT GCA 14 4 

25 Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn Glu Glu Glu Pro Asn Ala 

35 40 45 

CAT AAG GTC GCC AGC CCA CCC TCC GGA CCC GCA TAC CCC GAT GAT GTA 192 
His Lys Val Ala Ser Pro Pro Ser Gly Pro Ala Tyr Pro Asp Asp Val 
30 50 55 60 

ATG GAC TAT GGC CTC AAG CCA TAC AGC CCC CTT GCT AGT CTC TCT GGC 24 0 

Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro Leu Ala Ser Leu Ser Gly 
65 70 75 80 

35 

GAG CCC CCC GGC CGA TTC GGA GAG CCG GAT AGG GTA GGG CCG CAG AAG 288 

Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp Arg Val Gly Pro Gin Lys 

85 90 95 

40 TTT CTG AGC GCG GCC AAG CCA GCA GGG GCC TCG GGC CTG AGC CCT CGG 3 36 

Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala Ser Gly Leu Ser Pro Arg 

100 105 110 

ATC GAG ATC ACT CCG TCC CAC GAA CTG ATC CAG GCA GTG GGG CCC CTC 3 84 

45 lie Glu He Thr Pro Ser His Glu Leu He Gin Ala Val Gly Pro Leu 

115 120 125 

'^^.r- A""^ A^A HA'^ ^.^n ^.^.^ ^r^^ r^'^r^ '-•^^ -^^ ^ ^-.^ ^^.^ , - - 



a i A 1 a A .1 d Ser- r r o Arg P h e 



15 



I e VI Pro V a 
15 



Pro Giy V 



= ^-e Glu Gly 
160 
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Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala Ser Ser Gly Ser Ser Ala 

165 170 175 

AGC TTC ATT TCT GAG ACC TTC TCC CCC TAG ACC TCG CCC TGC GTC TCG 5 76 

5 Ser Phe He Ser Asp Thr Phe Ser Pro Tyr Thr Ser Pro Cys Val Ser 

180 185 190 

CCC AAT AAC GGC GGG CCC GAC GAC CTG TGT CCG CAG TTT CAA AAC ATC 62 4 

Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys Pro Gin Phe Gin Asn He 
10 195 200 205 

CCT GCT CAT TAT TCC CCC AGA ACC TCG CCA ATA ATG TCA CCT CGA ACC 6 72 

Pro Ala His Tyr Ser Pro Arg Thr Spr Pro He Met Ser Pre Arg Thr 

210 215 220 

15 

AGC CTC GCC GAG GAC AGC TGC CTG GGC CGC CAC TCG CCC GTG CCC CGT 72 0 

Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg His Ser Pro Val Pro Arg 

225 230 235 240 

20 CCG GCC TCC CGC TCC TCA TCG CCT GGT GCC AAG CGG AGG CAT TCG TGC 768 
Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala Lys Arg Arg His Ser Cys 

245 250 255 

GCC GAG GCC TTG GTT GCC CTG CCG CCC GGA GCC TCA CCC CAG CGC TCC 816 
25 Ala Glu Ala Leu Val Ala Leu Pro Pro Gly Ala Ser Pro Gin Arg Ser 

260 265 270 

CGG AGC CCC TCG CCG CAG CCC TCA TCT CAC GTG GCA CCC CAG GAC CAC 8 64 

Arg Ser Pro Ser Pro Gin Pro Ser Ser His Val Ala Pro Gin Asp His 
30 275 280 285 

GGC TCC CCG GCT GGG TAG CCC CCT GTG GCT GGC TCT GCC GTG ATC ATG 912 

Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala Gly Ser Ala Val He Met 

290 295 300 

35 

GAT GCC CTG AAC AGC CTC GCC ACG GAC TCG CCT TGT GGG ATC CCC CCC 96 0 

Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser Pro Cys Gly He Pro Pro 

305 310 315 320 

40 AAG ATG TGG AAG ACC AGC CCT GAC CCC TCG CCG GTG TCT GCC GCC CCA 10 0 8 
Lys Met Trp Lys Thr Ser Pro Asp Pro Ser Pro Val Ser Ala Ala Pro 

325 330 335 

TCC AAG GCC GGC CTG CCT CGC CAC ATC TAC CCG GCC GTG GAG TTC CTG 105 6 
45 Ser Lys Ala Gly Leu Pro Arg His He Tyr Pro Ala Val Glu Phe Leu 

340 345 350 
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Leu Leu Va; Pre Pre T:;r Trp Pro Lys Pre Leu Val Pro Ala He Pro 
370 375 380 
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He Cys Ser He Pro Val Thr Ala Ser Leu Pro Pro Leu Glu Trp Pro 
385 390 395 

CTG TCC AGT CAG TCA GGC TCT TAG GAG CTG CGG ATC GAG GTG CAG CCC 1248 

5 Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu Arg lie Glu Val Gin Pro 

405 410 415 

AAG CCA CAT CAC CGG GCC CAC TAT GAG ACA GAA GGC AGC CGA GGG GCT 12 96 

Lys Pro His His Arg Ala His Tyr Glu Thr Glu Gly Ser Arg Gly Ala 

10 420 425 430 

GTC AAA GCT CCA ACT GGA GGC CAC CCT GTG GTT CAG CTC CAT GGC TAC 1344 

vai L,ys Ala Pro Thr Gly Gly His Pro Val Val Gin Leu His Gly Tyr 
435 440 445 

15 

ATG GAA AAC AAG CCT CTG GGA CTT CAG ATC TTC ATT GGG ACA GCT GAT 13 92 

Met Glu Asn Lys Pro Leu Gly Leu Gin He Phe He Gly Thr Ala Asp 
450 455 460 

20 GAG CGG ATC CTT AAG CCG CAC GCC TTC TAC CAG GTG CAC CGA ATC ACG 1440 

Glu Arg He Leu Lys Pro His Ala Phe Tyr Gin Val His Arg He Thr 
465 470 475 480 



25 



35 



40 



45 



GGG AAA ACT GTC ACC ACC ACC AGC TAT GAG AAG ATA GTG GGC AAC ACC 1488 
Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu Lys He Val Gly Asn Thr 

485 490 495 



AAA GTC CTG GAG ATC CCC TTG GAG CCC AAA AAC AAC ATG AGG GCA ACC 153 6 

Lys Val Leu Glu He Pro Leu Glu Pro Lys Asn Asn Met Arg Ala Thr 

30 500 505 510 

ATC GAC TGT GCG GGG ATC TTG AAG CTT AGA AAC GCC GAC ATT GAG CTG 15 84 

He Asp Cys Ala Gly He Leu Lys Leu Arg Asn Ala Asp He Glu Leu 
515 520 525 



CGG AAA GGC GAG ACG GAC ATT GGA AGA AAG AAC ACG CGG GTG AGA CTG 1632 

Arg Lys Gly Glu Thr Asp He Gly Arg Lys Asn Thr Arg Val Arg Leu 

530 535 540 

GTT TTC CGA GTT CAC ATC CCA GAG TCC AGT GGC AGA ATC GTC TCT TTA 1680 

Val Phe Arg Val His He Pro Glu Ser Ser Gly Arg He Val Ser Leu 
545 550 555 560 

CAG ACT GCA TCT AAC CCC ATC GAG TGC TCC CAG CGA TCT GCT CAC GAG 172 8 

Gin Thr Ala Ser Asn Pro He Glu Cys Ser Gin Arg Ser Ala His Glu 

565 570 575 
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Gly Gin Gin Met He Leu Thr Gly Gin Asn Phe Thr Ser Glu Ser Lys 
595 600 605 
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Val Val Phe Thr Glu Lys Thr Thr Asp Gly Gin Gin lie Trp Glu Met 
610 615 620 

GAA GCC ACQ GTG GAT AAG GAG AAG AGO CAG CCC AAC ATG CTT TTT GTT 192 0 
5 Glu Ala Thr Val Asp Lys Asp Lys Ser Gin Pro Asn Met Leu Phe Val 
625 630 635 640 

GAG ATC OCT GAA TAT CGG AAC AAG CAT ATC CGC ACA CCT GTA AAA GTG 196 8 

Glu lie Pro Glu Tyr Arg Asn Lys His He Arg Thr Pro Val Lys Val 
10 645 650 655 

AAC TTC TAG GTC ATC AAT GGG AAG AGA AAA CGA AGT CAG CCT CAG CAC 2 016 

^^xi t xi.^ Aj^A. J.J.C: oxy uya g ijya Krg iaer KjlU FrO Uln HIS 

660 665 670 

15 

TTT ACC TAG CAC CCA GTC CCA GCC ATC AAG ACG GAG CCC ACG GAT GAA 2 064 

Phe Thr Tyr His Pro Val Pro Ala He Lys Thr Glu Pro Thr Asp Glu 

675 680 685 

20 TAT GAC CCC ACT CTG ATC TGC AGC CCC ACC CAT GGA GGC CTG GGG AGC 2112 
Tyr Asp Pro Thr Leu He Cys Ser Pro Thr His Gly Gly Leu Gly Ser 
690 695 700 



25 



CAG CCT TAC TAG CCC CAG CAC CCG ATG GTG GCC GAG TCC CCC TCC TGC 2160 
Gin Pro Tyr Tyr Pro Gin His Pro Met Val Ala Glu Ser Pro Ser Cys 
705 710 715 720 



GTC GTG GCC ACC ATG GCT CCC TGC CAG CAG TTC CGC ACG GGG CTC TCA 22 0 8 
Leu Val Ala Thr Met Ala Pro Cys Gin Gin Phe Arg Thr Gly Leu Ser 
30 725 730 735 

TCC CCT GAC GCC CGC TAC CAG CAA CAG AAC CCA GCG GCC GTA CTC TAC 2 25 6 

Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn Pro Ala Ala Val Leu Tyr 

740 745 750 

35 

CAG CGG AGC AAG AGC CTG AGC CCC AGC CTG CTG GGC TAT CAG CAG CCG 2 3 04 

Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu Leu Gly Tyr Gin Gin Pro 
755 760 765 

40 GCC CTC ATG GCC GCC CCG CTG TCC CTT GCG GAC GCT CAC CGC TCT GTG 235 2 

Ala Leu Met Ala Ala Pro Leu Ser Leu Ala Asp Ala His Arg Ser Val 
770 775 780 

CTG GTG CAC GCC GGC TCC CAG GGC CAG AGC TCA GCC CTG CTC CAC CCC 24 0 0 

45 Leu Val His Ala Gly Ser Gin Gly Gin Ser Ser Ala Leu Leu His Pro 
785 790 795 800 



Asn Gin Gin Leu Arg Cys Gly Ser His Gin Glu Phe Gin His He Met 

820 825 B30 
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Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr Arg Pro Gly Pro Pro Pro 
835 840 845 

GTC AGT CAA GGT CAG AGG CTG AGC CCG GGT TCC TAG CCC ACA GTC ATT 2 5 92 
5 Val Ser Gin Gly Gin Arg Leu Ser Pro Gly Ser Tyr Pro Thr Val lie 
850 855 860 

CAG CAG CAG AAT GCC ACG AGC CAA AGA GCC GCC AAA AAC GGA CCC CCG 2 64 0 
Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala Ala Lys Asn Gly Pro Pro 
10 865 870 875 880 

GTC AGT GAC CAA AAG GAA GTA TTA CCT GCG GGG GTG ACC ATT AAA CAG 268 8 

Val Ser Asp Gin Lys Glu Val Leu Pro Ala Gly Val Thi He Lys Gin 

885 890 895 

15 

GAG CAG AAC TTG GAC CAG ACC TAC TTG GAT GAT GTT AAT GAA ATT ATC 273 6 

Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp Asp Val Asn Glu He He 

900 905 910 

20 AGG AAG GAG TTT TCA GGA CCT CCT GCC AGA AAT CAG ACG AGA ATT CTG 2 7 84 
Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg Asn Gin Thr Arg He Leu 
915 920 925 



25 



45 



CAG TCG ACG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG GTG 2 83 2 

Gin Ser Thr Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val 
930 935 940 



AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG 2 880 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 

30 945 950 955 960 

CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC 2 92 8 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

965 970 975 

35 

GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC 2 976 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 

980 985 990 

40 ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC 3 024 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
995 1000 1005 



TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC 3 072 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
1010 1015 1020 
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le Phe Phe Lys Asp Asp Gly Asn lyr Lys Thr Arg Ala Glu Val Lys 

1045 1050 1055 
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Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp 

1060 1065 1070 

TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG AAC TAG 3 2 64 

5 Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr 

1075 1080 1085 

AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC 3 312 

Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 
10 1090 1095 1100 

AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG 3 3 60 

Lys Val Asn Phe Lys He Arg His Asn He Glu Asn Gly Ser Val Gin 
1105 1110 1115 ' ' 1120 

15 

CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG 34 0 8 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 

1125 1130 1135 

20 CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA 3 45 6 

Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

1140 1145 1150 



25 



40 



45 



GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC 35 04 
Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
1155 1160 1165 



GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 3 54 6 

Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lye 
30 1170 1175 1180 



(2) INFORMATION FOR SEQ ID NO: 13 3 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 

Met Asn Ala Pro Glu Arg Gin Pro Gin Pro Asp Gly Gly Asp Ala Pro 
15 10 15 

Glv His Glu Pro Glv Giv Ff^r Pr^ A--^ " " - 



55 



50 

Ket Asp Ty 
65 



55 60 
Gly Leu Lys Fro Tyr Ser Pro Leu Ala Se; 
70 75 



Leu Ser Gly 
80 



r ----- 
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10 



15 



20 



25 



30 



35 



40 



45 



Phe 

He 

Arg 

Val 
145 
Tyr 

Ser 

Pro 

Pro 

Ser 
225 
Pro 

Ala 

Arg 

Gly 

Asp 
305 
Lys 

Ser 

Gly 

Leu 

He 
385 
Leu 

Lys 

Val 

Met 

Glu 



85 

Leu Ser Ala Ala 
100 

Glu He Thr Pro 
115 

Met Arg Asp Ala 
13 0 

Ala Ala Ser Pro 

Arg Glu Pro Leu 

165 

Phe He Ser Asp 
180 

Asn Asn Gly Gly 
195 

Ala His Tyr Ser 
210 

Leu Ala Glu Asp 

Ala Ser Arg Ser 

245 

Glu Ala Leu Val 
260 

Ser Pro Ser Pro 
275 

Ser Pro Ala Gly 
290 

Ala Leu Asn Ser 

Met Trp Lys Thr 

325 

Lys Ala Gly Leu 
340 

Pro Cys Glu Gin 
355 

Leu Val Pro Pro 
370 

Cys Ser He Pro 

Ser Ser Gin Ser 

405 

Pro His His Arg 
420 

Lys Ala Pro Thr 
435 

Glu Asn Lye Pro 
450 

Aro He Leu r.vs 



Lys 

Ser 

Gly 

Arg 
150 

Cys 

Thr 

Pro 

Pro 

Ser 
230 
Ser 



90 

Pro Ala Gly Ala 
105 

His Glu Leu He 
120 

Leu Leu Val Glu 
135 

Phe Thr Leu Pro 

Leu Ser Pro Ala 

170 

Phe Ser Pro Tyr 
185 

Asp Asp Leu Cys 
200 

Arg Thr Ser Pro 
215 

Cys Leu Gly Arg 



Ser Pro 



Ala Leu Pro 
Gin 
Tyr 



Leu 
310 
Ser 

Pro 

Gly 

Thr 

Val 
390 
Gly 

Ala 

Gly 

Leu 



Pro Ser 
280 
Pro Pro 
295 

Ala Thr 



Gly Ala 
250 
Pro Gly 
265 

Ser His 
Val Ala 
Asp Ser 



Pro Asp Pro Ser 

330 

Arg His He Tyr 
345 

Glu Arg Arg Asn 
360 

Trp Pro Lys Pro 
375 

Thr Ala Ser Leu 

Ser Tyr Glu Leu 

410 

His Tyr Glu Thr 
425 

Gly His Pro Val 
440 

Gly Leu Gin He 
455 



Ser Gly 

Gin Ala 

Gin Pro 
140 
Val Pro 
155 

Ser Ser 

Thr Ser 

Pro Gin 

He Met 
220 
His Ser 
235 

Lys Arg 
Ala Ser 
Val Ala 
Gly Ser 

A A 
J W U 

Pro Cys 
315 

Pro Val 

Pro Ala 

Ser Ala 

Leu Val 
380 
Pro Pro 
395 

Arg He 

Glu Gly 

Val Gin 

Phe He 
460 



95 

Leu Ser Pro Arg 
110 

Val Gly Pro Leu 
125 

Pro Leu Ala Gly 

Gly Phe Glu Gly 

160 

Gly Ser Ser Ala 
175 

Pro Cys Val Ser 



inn 
X ^ u 



Phe Gin Asn He 
205 

Ser Pro Arg Thr 

Pro Val Pro Arg 

240 

Arg His Ser Cys 
255 

Pro Gin Arg Ser 
270 

Pro Gin Asp His 
285 

Ala Val He Met 

Gly He Pro Pro 

320 

Ser Ala Ala Pro 
335 

Val Glu Phe Leu 
350 

Pro Glu Ser He 
365 

Pro Ala He Pro 

Leu Glu Trp Pro 

400 

Glu Val Gin Pro 
415 

Ser Arg Gly Ala 
430 

Leu His Gly Tyr 
445 

Gly Thr Ala Asp 



Pro Hir A^. -) V 
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500 

le Asp Cys Ala Gly He 
515 



505 

,eu Lys Leu Arg As: 
520 



510 

Ala Asp He Glu Le- 
525 



A: 
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10 



15 



20 



25 



30 



35 



40 



45 



530 
Val Phe 
545 

Gin Thr 

Leu Pro 

Gly Gin 

Val Val 
610 
Glu Ala 
625 

Glu He 

Asn Phe 

Phe Thr 

Tyr Asp 
690 
Gin Pro 
705 

Leu Val 

Ser Pro 

Gin Arg 

Ala Leu 
770 
Leu Val 
785 

Ser Pro 

Asn Gin 

Tyr Cys 

Val Ser 
850 
Gin Gin 
865 

Val Ser 

Glu Gin 



Arg Val His He 

550 

Ala Ser Asn Pro 
565 

Met Val Glu Arg 
580 

Gin Met He Leu 
595 

Phe Thr Glu Lys 

Thr Val Asp Lys 

63 0 

Pro Glu Tyr Arg 
645 

Tyr Val He Asn 
660 

Tyr His Pro Val 
675 

Pro Thr Leu He 

Tyr Tyr Pro Gin 

710 

Ala Thr Met Ala 
725 

Asp Ala Arg Tyr 

Ser Lys Ser Leu 
755 

Met Ala Ala Pro 

His Ala Gly Ser 

790 

Thr Asn Gin Gin 
805 

Gin Leu Arg Cys 
620 

Glu Asn Phe Ala 
835 

Gin Gly Gin Arg 

Gin Asn Ala Thr 

870 

Asp Gin Lys Glu 
885 

Asn Leu Asp Gin 
900 



535 

Pro Glu Ser 

He Glu Cys 

Gin Asp Thr 
585 

Thr Gly Gin 
600 

Thr Thr Asp 
615 

Asp Lys Ser 

Asn Lys His 

Gly Lys Arg 
665 

Pro Ala He 
680 

Cys Ser Pro 
695 

His Pro Met 
Pro Cys Gin 

Gin Gin Gin 
745 

Ser Pro Ser 
760 

Leu Ser Leu 
775 

Gin Gly Gin 

Ala Ser Pro 

Gly Ser His 
825 

Pro Gly Thr 
840 

Leu Ser Pro 
855 

Ser Gin Arg 

Val Leu Pro 

Thr Tyr Leu 
905 



540 

Ser Gly Arg He 
555 

Ser Gin Arg Ser 
570 

Asp Ser Cys Leu 

Asn Phe Thr Ser 

605 

Gly Gin Gin He 
620 

Gin Pro Asn Met 

^ ^ mm 

O J s 

He Arg Thr Pro 
650 

Lys Arg Ser Gin 

Lys Thr Glu Pro 

685 

Thr His Gly Gly 
700 

Val Ala Glu Ser 
715 

Gin Phe Arg Thr 
730 

Asn Pro Ala Ala 

Leu Leu Gly Tyr 

765 

Ala Asp Ala His 
780 

Ser Ser Ala Leu 
795 

Val He His Tyr 
810 

Gin Glu Phe Gin 

Thr Arg Pro Gly 

845 

Gly Ser Tyr Pro 
860 

Ala Ala Lys Asn 
875 

Ala Gly Val Thr 
890 

Asp Asp Val Asn 



Val 

Ala 

Val 
590 
Glu 

Trp 

Leu 

Val 

Pro 
670 

Thr 

Leu 

Pro 

Gly 

Val 
750 
Gin 

Arg 

Leu 

Ser 

His 
830 
Pro 

Thr 

Gly 

He 

Glu 
910 



Ser Leu 
560 
His Glu 
575 

Tyr Gly 

Ser Lys 

Glu Met 

Phe Val 
640 
Lys Val 
655 

Gin His 

Asp Glu 

Gly Ser 

Ser Cys 
720 
Leu Ser 
735 

Leu Tyr 

Gin Pro 

Ser Val 

His Pro 
800 
Pro Thr 
815 

He Met 

Pro Pro 

Val He 

Pro Pro 
880 

Lys Gin 
895 

He He 



55 



1 * . Ci ^ 



_ e u . 



S50 955 960 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

965 970 9-75 
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980 985 990 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 

995 1000 1005 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 

1010 1015 1020 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
025 1030 1035 1040 

lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

1045 1050 1055 

Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp 

1060 1065 1070 

Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr 

1075 lose 1055 

Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 
15 1090 1095 1100 

Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser Val Gin 
105 1110 1115 1120 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 

1125 1130 1135 

20 Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

1140 1145 1150 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 

1155 1160 1165 

Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
25 1170 1175 1180 

{2} INFOP.MATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 {ii} MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2799 
40 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



45 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



r^J^ ...J .... - - : ./-L . .... ^. J. vJ M./A^7 1 ^ w /A i '1 *1 



Glu Gly Glu Gly Asp Ala Thr T\^r Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



55 



SUBSTITUTE SHECT (RULE 26) 



wo 98/45704 



PCT/DK98/00145 



270 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAG CCC GAG CAC ATG AAG 24 0 

5 Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAG GTC CAG GAG 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
10 85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAG AAG ACC CGC GCC GAG 336 

Arg Thr lie Phe Phe T,ys Asp Asp Cly Asn T-yr Lys Thr Arg Ala Glu 

100 105 110 

15 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

20 ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG 43 2 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 48 0 

25 Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
30 165 170 175 

GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

35 

CCC GTG CTG CTG CCC GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG 6 24 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

40 AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 67 2 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAG AAG TCC 72 0 

45 Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

nrjA ^'^"^ Ann ' — - — - 
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Aia Leu Gin Glu Lys He Glu Glu Leu Arg Gin Arg Asp Ala Leu He 

26C 265 270 
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Asp Glu Leu Glu Leu Glu Leu Asp Gin Lys Asp Glu Leu He Gin Lys 

275 280 285 

CTG CAG AAC GAG CTG GAG AAG TAG CGC TCG GTG ATC CGA CCA GCC ACC 912 

5 Leu Gin Asn Glu Leu Asp Lys Tyr Arg Ser Val lie Arg Pro Ala Thr 

290 295 300 

CAG CAG GCG CAG AAG CAG AGC GCG AGC ACC TTG CAG GGC GAG CCG CGC 960 

Gin Gin Ala Gin Lys Gin Ser Ala Ser Thr Leu Gin Gly Glu Pro Arg 

10 305 310 315 320 

ACC AAG CGG CAG GCG ATC TCC GCC GAG CCC ACC GCC TTC GAG ATC CAG 10 0 6 

Thr Lys Arq Gin Ala He Ser Ala oiu Pre Thr Ala The Asp He Gin 

325 330 335 

15 

GAT CTC AGC CAT GTG ACC CTG CCC TTC TAC CCC AAG AGC CCA CAG TCC 10 5 6 

Asp Leu Ser His Val Thr Leu Pro Phe Tyr Pro Lys Ser Pro Gin Ser 

340 345 350 

20 AAG GAT CTT ATA AAG GAA GCT ATC CTT GAC AAT GAG TTT ATG AAG AAC 1104 

Lys Asp Leu lie Lys Glu Ala He Leu Asp Asn Asp Phe Met Lys Asn 

355 360 365 



25 



45 



TTG GAG CTG TCG CAG ATC CAG GAG ATT GTG GAT TGT ATG TAC CCG GTG 1152 
Leu Glu Leu Ser Gin He Gin Glu He Val Asp Cys Met Tyr Pro Val 
370 375 380 



GAG TAT GGC AAG GAC AGT TGC ATC ATC AAA GAA GGA GAC GTG GGG TCA 12 00 

Glu Tyr Gly Lys Asp Ser Cys He He Lys Glu Gly Asp Val Gly Ser 

30 385 390 395 400 

CTG GTG TAT GTC ATG GAA GAT GGT AAG GTT GAA GTT ACA AAA GAA GGT 124 8 

Leu Val Tyr Val Met Glu Asp Gly Lys Val Glu Val Thr Lys Glu Gly 

405 410 415 

35 

GTG AAG TTG TGT ACC ATG GGT CCA GGA AAA GTG TTT GGG GAA TTG GCT 12 96 

Val Lys Leu Cys Thr Met Gly Pro Gly Lys Val Phe Gly Glu Leu Ala 

420 425 430 

40 ATT CTT TAC AAC TGT ACC CGG ACA GCG ACC GTC AAG ACT CTT GTA AAT 1344 

He Leu Tyr Asn Cys Thr Arg Thr Ala Thr Val Lys Thr Leu Val Asn 

435 440 445 



GTA AAA CTC TGG GCC ATT GAT CGA CAA TGT TTT CAA ACA ATA ATG ATG 13 92 

Val Lys Leu Trp Ala He Asp Arg Gin Cys Phe Gin Thr He Met Met 
450 455 460 



AGG ACA r^'^^ 



Val. Pre Thr Fhe Gin Ser Leu Pro Glu Glu lie Leu Ser Lys Leu Ala 

485 49C 495 
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Asp Val Leu Glu Glu Thr His Tyr Glu Asn Gly Glu Tyr lie He Arg 

500 505 510 

CAA GGT GCA AGA GGG GAC ACC TTC TTT ATC ATC AGC AAA GGA ACG GTA 15 84 
5 Gin Gly Ala Arg Gly Asp Thr Phe Phe He He Ser Lys Gly Thr Val 

515 520 525 

AAT GTC ACT CGT GAA GAC TCA CCG AGT GAA GAC CCA GTC TTT CTT AGA 163 2 
Asn Val Thr Arg Glu Asp Ser Pro Ser Glu Asp Pro Val Phe Leu Arg 
10 530 535 540 

ACT TTA GGA AAA GGA GAC TGG TTT GGA GAG AAA GCC TTG CAG GGG GAA 16 8 0 

Thr Leu Gly Lys Gly Asp Trp Php my oiu Lys Ala Leu Gin Gly Glu 

545 550 555 560 

15 

GAT GTG AGA ACA GCA AAC GTA ATT GCT GCA GAA GCT GTA ACC TGC CTT 172 8 

Asp Val Arg Thr Ala Asn Val He Ala Ala Glu Ala Val Thr Cys Leu 

565 570 575 

20 GTG ATT GAC AGA GAC TCT TTT AAA CAT TTG ATT GGA GGG CTG GAT GAT 1776 
Val He Asp Arg Asp Ser Phe Lys His Leu He Gly Gly Leu Asp Asp 

580 585 590 

GTT TCT AAT AAA GCA TAT GAA GAT GCA GAA GCT AAA GCA AAA TAT GAA 1824 
25 Val Ser Asn Lys Ala Tyr Glu Asp Ala Glu Ala Lys Ala Lys Tyr Glu 

595 600 605 

GCT GAA GCG GCT TTC TTC GCC AAC CTG AAG CTG TCT GAT TTC AAC ATC 1872 
Ala Glu Ala Ala Phe Phe Ala Asn Leu Lys Leu Ser Asp Phe Asn He 
30 610 615 620 

ATT GAT ACC CTT GGA GTT GGA GGT TTC GGA CGA GTA GAA CTG GTC CAG 192 0 

He Asp Thr Leu Gly Val Gly Gly Phe Gly Arg Val Glu Leu Val Gin 

625 630 635 640 

35 

TTG AAA AGT GAA GAA TCC AAA ACG TTT GCA ATG AAG ATT CTC AAG AAA 196 8 

Leu Lys Ser Glu Glu Ser Lys Thr Phe Ala Met Lys He Leu Lys Lys 

645 650 655 

40 CGT CAC ATT GTG GAC ACA AGA CAG CAG GAG CAC ATC CGC TCA GAG AAG 2 016 

Arg His He Val Asp Thr Arg Gin Gin Glu His He Arg Ser Glu Lys 

660 665 670 

CAG ATC ATG CAG GGG GCT CAT TCC GAT TTC ATA GTG AGA CTG TAG AGA 2 064 
45 Gin He Met Gin Gly Ala His Ser Asp Phe He Val Arg Leu Tyr Arg 

675 680 685 

ACA TTT AAG GAC. A-C AAA TA" '"^^^ '^r '^ " — ""^^ '"^ ' - ^'"^ ■ ■ - 



viiy Gly Gxu Leu Trp Thr He Leu Arg Asp Arg Gly Ser Phe Glu Asp 
'^OS 71C 715 720 
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Ser Thr Thr Arg Phe Tyr Thr Ala Cys Val Val Glu Ala Phe Ala Tyr 

725 730 735 

CTG CAT TCC AAA GGA ATC ATT TAC AGG GAC CTC AAG CCA GAA AAT CTC 225 6 

5 Leu His Ser Lys Gly lie He Tyr Arg Asp Leu Lys Pro Glu Asn Leu 

740 745 750 

ATC CTA GAT CAC CGA GGT TAT GCC AAA CTG GTT GAT TTT GGC TTT GCA 2 3 04 

He Leu Asp His Arg Gly Tyr Ala Lys Leu Val Asp Phe Gly Phe Ala 
10 755 760 765 

AAG AAA ATA GGA TTT GGA AAG AAA ACA TGG ACT TTT TGT GGG ACT CCA 2 3 52 

Lys Lys lie Gly Phe Gly Lys Lys Thr Trn Thr Phe Cyc Gly Thr Pru 

770 775 780 

15 

GAG TAT GTA GCC CCA GAG ATC ATC CTG AAC AAA GGC CAT GAC ATT TCA 2 4 00 

Glu Tyr Val Ala Pro Glu He He Leu Asn Lys Gly His Asp He Ser 
785 790 795 800 

20 GCC GAC TAC TGG TCA CTG GGA ATC CTA ATG TAT GAA CTC CTG ACT GGC 2 44 8 

Ala Asp Tyr Trp Ser Leu Gly He Leu Met Tyr Glu Leu Leu Thr Gly 

805 BIO 815 

AGC CCA CCT TTC TCA GGC CCA GAT CCT ATG AAA ACC TAT AAC ATC ATA 2 4 96 

25 Ser Pro Pro Phe Ser Gly Pro Asp Pro Met Lys Thr Tyr Asn He He 

820 825 830 

TTG AGG GGG ATT GAC ATG ATA GAA TTT CCA AAG AAG ATT GCC AAA AAT 2 544 

Leu Arg Gly He Asp Met He Glu Phe Pro Lys Lys He Ala Lys Asn 
30 835 840 845 

GCT GCT AAT TTA ATT AAA AAA CTA TGC AGG GAC AAT CCA TCA GAA AGA 2 5 92 

Ala Ala Asn Leu He Lys Lys Leu Cys Arg Asp Asn Pro Ser Glu Arg 

850 855 860 

35 

TTA GGG AAT TTG AAA AAT GGA GTA AAA GAC ATT CAA AAG CAC AAA TGG 2 64 0 

Leu Gly Asn Leu Lys Asn Gly Val Lys Asp He Gin Lys His Lys Trp 
865 870 875 880 

40 TTT GAG GGC TTT AAC TGG GAA GGC TTA AGA AAA GGT ACC TTG ACA CCT 26 8 8 

Phe Glu Gly Phe Asn Trp Glu Gly Leu Arg Lys Gly Thr Leu Thr Pro 

885 890 895 



45 



CCT ATA ATA CCA AGT GTT GCA TCA CCC ACA GAC ACA ACT AAT TTT GAC 2 736 

Pro He He Pro Ser Val Ala Ser Pro Thr Asp Thr Ser Asn Phe Asp 

900 905 910 



AGT TTC CCT GAG GAP AA'^ ^A'^ ^'r. 



Asp i^e AGp Phe 
93C 
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(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 3 amino acids 

5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

20 Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
25 85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

30 He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
35 165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

40 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Gly Ser Met Gly Thr Leu Arg Asp Leu Gin Tyr 
^5 245 250 255 

Ala Leu Gin Glu Lys He Glu Glu Leu Arg Gin Arg Asp Ala Leu He 

260 265 270 



e I 
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305 310 
Thr Lys Arg Gin Ala He Ser Ala 

325 



315 32C 
Glu Pro Thr Ala Phe Asp He Gin 

3 3 0 3 3 5 
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340 345 350 

Lys Asp Leu He Lys Glu Ala He Leu Asp Asn Asp Phe Met Lys Asn 

355 360 365 

Leu Glu Leu Ser Gin He Gin Glu He Val Asp Cys Met Tyr Pro Val 

370 375 380 

Glu Tyr Gly Lys Asp Ser Cys He He Lys Glu Gly Asp Val Gly Ser 
385 390 395 400 

Leu Val Tyr Val Met Glu Asp Gly Lys Val Glu Val Thr Lys Glu Gly 

405 410 415 

Val Lys Leu Cys Thr Met Gly Pro Gly Lys Val Phe Gly Glu Leu Ala 

420 425 430 

He Leu Tyr Asn Cys Thr Arg Thr Ala Thr Val Lys Thr Leu Val Asn 

435 440 445 

Val Lys Leu Trp Ala He Asp Arg Gin Cys Phe Gin Thr He Met Met 
15 450 455 460 

Arg Thr Gly Leu He Lys His Thr Glu Tyr Met Glu Phe Leu Lys Ser 
465 470 475 480 

Val Pro Thr Phe Gin Ser Leu Pro Glu Glu He Leu Ser Lys Leu Ala 

485 490 495 

Asp Val Leu Glu Glu Thr His Tyr Glu Asn Gly Glu Tyr He He Arg 

500 505 510 

Gin Gly Ala Arg Gly Asp Thr Phe Phe He He Ser Lys Gly Thr Val 

515 520 525 

Asn Val Thr Arg Glu Asp Ser Pro Ser Glu Asp Pro Val Phe Leu Arg 
25 530 535 540 

Thr Leu Gly Lys Gly Asp Trp Phe Gly Glu Lys Ala Leu Gin Gly Glu 
545 550 555 560 

Asp Val Arg Thr Ala Asn Val He Ala Ala Glu Ala Val Thr Cys Leu 

565 570 575 

Val He Asp Arg Asp Ser Phe Lys His Leu He Gly Gly Leu Asp Asp 

560 585 590 

Val Ser Asn Lys Ala Tyr Glu Asp Ala Glu Ala Lys Ala Lys Tyr Glu 

595 600 605 

Ala Glu Ala Ala Phe Phe Ala Asn Leu Lys Leu Ser Asp Phe Asn He 
35 610 615 620 

He Asp Thr Leu Gly Val Gly Gly Phe Gly Arg Val Glu Leu Val Gin 
625 630 635 640 

Leu Lys Ser Glu Glu Ser Lys Thr Phe Ala Met Lys He Leu Lys Lys 

645 650 655 

Arg His He Val Asp Thr Arg Gin Gin Glu His He Arg Ser Glu Lys 

660 665 670 

Gin He Met Gin Gly Ala His Ser Asp Phe He Val Arg Leu Tyr Arg 

675 680 685 

Thr Phe Lys Asp Ser Lys Tyr Leu Tyr Met Leu Met Glu Ala Cys Leu 
45 690 695 700 

Gly Gly Glu Leu Trp Thr He Leu Arg Asp Arg Gly Ser Phe Glu Asp 
705 710 715 720 



30 



40 
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755 760 765 

Lys Lys He G:y Phe Gly Lys Lys Thr Trp Thr Phe Cys Gly Thr Pro 
770 775 780 
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785 790 795 800 

Ala Asp Tyr Trp Ser Leu Gly lie Leu Met Tyr Glu Leu Leu Thr Gly 

805 810 815 

Ser Pro Pro Phe Ser Gly Pro Asp Pro Met Lys Thr Tyr Asn He He 
5 820 825 830 

Leu Arg Gly He Asp Met He Glu Phe Pro Lys Lys He Ala Lys Asn 

835 840 845 

Ala Ala Asn Leu He Lys Lys Leu Cys Arg Asp Asn Pro Ser Glu Arg 
850 855 860 

10 Leu Gly Asn Leu Lys Asn Gly Val Lys Asp He Gin Lys His Lys Trp 
865 870 875 880 

Phe Glu Gly Phe Asn Trp Glu Gly Leu Arg Lys Gly Thr Leu Thr Pro 

885 890 895 

Pro lie He Pro Ser Val Ala Ser Pro Thr Asp Thr Ser Asn Phe Asp 
15 900 905 910 

Ser Phe Pro Glu Asp Asn Asp Glu Pro Pro Pro Asp Asp Asn Ser Gly 

915 920 925 

Trp Asp He Asp Phe 
930 



20 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 13 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2799 base pairs 

25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
30 (ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1.,.2795 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6 



ATG GGC ACC TTG CGG GAT TTA CAG TAC GCG CTC CAG GAG AAG ATC GAG 4 8 

Met Gly Thr Leu Arg Asp Leu Gin Tyr Ala Leu Gin Glu Lys He Glu 
40 1 5 10 15 

GAG CTG AGG CAG CGG GAT GCT CTC ATC GAC GAG CTG GAG CTG GAG TTG 96 

Glu Leu Arg Gin Arg Asp Ala Leu He Asp Glu Leu Glu Leu Glu Leu 

20 25 30 



GAT CAG AAG GAC GAA CTG ATC CAG AAG CTG CAG AAC GAG CTG GAC AAG 14 4 

Asp Gin Lys Asp Glu Leu He Gin Lys Leu Gin Asn Glu Leu Asp Lys 
3 5 4 0 ^ 



GCG AGC ACC TTG CAG GGC GAG GCG GGC ACC AAG CGG CAG GCG ATC TCG 24 C 

55 Ala Ser Thr Leu Gin Gly Glu Pro Arg Thr Lys Arg Gin Ala He Ser 
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GCC GAG CCC ACC GCC TTC GAC ATC CAG GAT CTC AGC CAT GTG ACC CTG 2 88 

Ala Glu Pro Thr Ala Phe Asp He Gin Asp Leu Ser His Val Thr Leu 

85 90 95 

5 

CCC TTC TAC CCC AAG AGC CCA CAG TCC AAG GAT CTT ATA AAG GAA GCT 336 

Pro Phe Tyr Pro Lys Ser Pro Gin Ser Lys Asp Leu He Lys Glu Ala 

100 105 110 

10 ATC CTT GAC AAT GAC TTT ATG AAG AAC TTG GAG CTG TCG CAG ATC CAG 3 84 

He Leu Asp Asn Asp Phe Met Lys Asn Leu Glu Leu Ser Gin He Gin 

115 120 125 



15 



35 



45 



55 



GAG ATT GTG GAT TGT ATG TAC CCG GTG GAG TAT GGC AAG GAC AGT TGC 43 2 

Glu He Val Asp Cys Met Tyr Pro Val Glu Tyr Gly Lys Asp Ser Cys 
130 135 140 



ATC ATC AAA GAA GGA GAC GTG GGG TCA CTG GTG TAT GTC ATG GAA GAT 480 
He He Lys Glu Gly Asp Val Gly Ser Leu Val Tyr Val Met Glu Asp 
20 145 150 155 160 

GGT AAG GTT GAA GTT ACA AAA GAA GGT GTG AAG TTG TGT ACC ATG GGT 52 8 

Gly Lys Val Glu Val Thr Lys Glu Gly Val Lys Leu Cys Thr Met Gly 

165 170 175 

25 

CCA GGA AAA GTG TTT GGG GAA TTG GCT ATT CTT TAC AAC TGT ACC CGG 576 
Pro Gly Lys Val Phe Gly Glu Leu Ala He Leu Tyr Asn Cys Thr Arg 

180 185 190 

30 ACA GCG ACC GTC AAG ACT CTT GTA AAT GTA AAA CTC TGG GCC ATT GAT 624 
Thr Ala Thr Val Lys Thr Leu Val Asn Val Lys Leu Trp Ala He Asp 
195 200 205 



CGA CAA TGT TTT CAA ACA ATA ATG ATG AGG ACA GGA CTC ATC AAG CAT 67 2 

Arg Gin Cys Phe Gin Thr He Met Met Arg Thr Gly Leu He Lys His 
210 215 220 



ACC GAG TAT ATG GAA TTT TTA AAA AGC GTT CCA ACA TTC CAG AGC CTT 72 0 

Thr Glu Tyr Met Glu Phe Leu Lys Ser Val Pro Thr Phe Gin Ser Leu 
40 225 230 235 240 

CCT GAA GAG ATC CTC AGC AAG CTT GCT GAT GTC CTT GAA GAG ACC CAC 7 68 

Pro Glu Glu He Leu Ser Lys Leu Ala Asp Val Leu Glu Glu Thr His 

245 250 255 



TAT GAA AAT GGA GAA TAT ATT ATC AGG CAA GGT GCA AGA GGG GAC ACC 816 
Tyr Glu Asn Gly Glu Tyr He He Arg Gin Gly Ala Arg Gly Asp Thr 



r 



^. :j 3 



CCG AGT GAA GAC CCA GTC TTT CTT AGA ACT TTA GGA AAA GGA GAC TGG 91, 
Pro Ser Glu Asp Pro Val Phe Leu Ara Thr Leu Gly Lys Gly Asp Trp 
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TTT GGA GAG AAA GCC TTG CAG GGG GAA GAT GTG AGA ACA GCA AAC GTA 96 0 

Phe Gly Glu Lys Ala Leu Gin Gly Glu Asp Val Arg Thr Ala Asn Val 
305 310 315 320 

5 

ATT GCT GCA GAA GCT GTA ACC TGC CTT GTG ATT GAG AGA GAC TCT TTT 10 0 8 
He Ala Ala Glu Ala Val Thr Cys Leu Val He Asp Arg Asp Ser Phe 

325 330 335 

10 AAA CAT TTG ATT GGA GGG CTG GAT GAT GTT TCT AAT AAA GCA TAT GAA 105 6 

Lys His Leu He Gly Gly Leu Asp Asp Val Ser Asn Lys Ala Tyr Glu 

340 345 350 



15 



35 



45 



GAT GCA GAA GCT AAA GCA AAA TAT GAA GCT GAA GCG GCT TTC TTC GCC 1104 
Asp Ala Glu Ala Lys Ala Lys Tyr Glu Ala Glu Ala Ala Phe Phe Ala 
355 360 365 



AAC CTG AAG CTG TCT GAT TTC AAC ATC ATT GAT ACC CTT GGA GTT GGA 1152 

Asn Leu Lys Leu Ser Asp Phe Asn He He Asp Thr Leu Gly Val Gly 

20 370 375 380 

GGT TTC GGA CGA GTA GAA CTG GTC CAG TTG AAA AGT GAA GAA TCC AAA 12 0 0 

Gly Phe Gly Arg Val Glu Leu Val Gin Leu Lys Ser Glu Glu Ser Lys 
385 390 395 400 

25 

ACG TTT GCA ATG AAG ATT CTC AAG AAA CGT CAC ATT GTG GAC ACA AGA 124 8 

Thr Phe Ala Met Lys He Leu Lys Lys Arg Kis He Val Asp Thr Arg 

405 410 415 

30 CAG CAG GAG CAC ATC CGC TCA GAG AAG CAG ATC ATG CAG GGG GCT CAT 12 96 

Gin Gin Glu His He Arg Ser Glu Lys Gin He Met Gin Gly Ala His 

420 425 430 



TCC GAT TTC ATA GTG AGA CTG TAC AGA ACA TTT AAG GAC AGC AAA TAT 1344 
Ser Asp Phe He Val Arg Leu Tyr Arg Thr Phe Lys Asp Ser Lys Tyr 
435 440 445 



TTG TAT ATG TTG ATG GAA GCT TGT CTA GGT GGA GAG CTC TGG ACC ATT 13 9 2 

Leu Tyr Met Leu Met Glu Ala Cys Leu Gly Gly Glu Leu Trp Thr He 

40 450 455 460 

CTC AGG GAT AGA GGT TCG TTT GAA GAT TCT ACA ACC AGA TTT TAC ACA 144 0 

Leu Arg Asp Arg Gly Ser Phe Glu Asp Ser Thr Thr Arg Phe Tyr Thr 
465 470 475 480 



GCA TGT GTG GTA GAA GCT TTT GCC TAT CTG CAT TCC AAA GGA ATC ATT 14 8 8 

Ala Cys Val Val Glu Ala Phe Ala Tyr Leu His Ser Lys Gly He He 

4 P !^ ' - - 



55 



GCC AAA CTG GTT GAT TTT GGC TTT GCA AAG AAA ATA GGA TTT GGA AAG 15 84 

Ala Lys Leu Val Asp Phe Gly Phe Ala Lys Lys He Gly Phe Gly Lys 
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10 



15 



35 



45 



AAA ACA TGG ACT TTT TGT GGG ACT CCA GAG TAT GTA GCC CCA GAG ATC 163 2 

Lys Thr Trp Thr Phe Cys Gly Thr Pro Glu Tyr Val Ala Pro Glu He 

530 535 540 

ATC CTG AAC AAA GGC CAT GAC ATT TCA GCC GAC TAG TGG TCA CTG GGA 16 BO 

He Leu Asn Lys Gly His Asp He Ser Ala Asp Tyr Trp Ser Leu Gly 

550 555 560 

ATC CTA ATG TAT GAA CTC CTG ACT GGC AGC CCA CCT TTC TCA GGC CCA 172 8 

He Leu Met Tyr Glu Leu Leu Thr Gly Ser Pro Pro Phe Ser Gly Pro 

565 570 575 

GAT CCT ATG AAA ACC TAT AAC ATC ATA TTG AGG GGG ATT GAC ATG ATA 1776 

Asp Pro Met Lys Thr Tyr Asn lie lie Leu Arg Gly He Asp Met He 

580 585 590 



GAA TTT CCA AAG AAG ATT GCC AAA AAT GCT GCT AAT TTA ATT AAA AAA 182 4 

Glu Phe Pro Lys Lys He Ala Lys Asn Ala Ala Asn Leu He Lys Lys 
20 595 600 605 

CTA TGC AGG GAC AAT CCA TCA GAA AGA TTA GGG AAT TTG AAA AAT GGA 1872 

Leu Cys Arg Asp Asn Pro Ser Glu Arg Leu Gly Asn Leu Lys Asn Gly 
610 615 620 

25 

GTA AAA GAC ATT CAA AAG CAC AAA TGG TTT GAG GGC TTT AAC TGG GAA 192 0 

Val Lys Asp He Gin Lys His Lys Trp Phe Glu Gly Phe Asn Trp Glu 

625 630 635 640 

30 GGC TTA AGA AAA GGT ACC TTG ACA CCT CCT ATA ATA CCA AGT GTT GCA 196 8 

Gly Leu Arg Lys Gly Thr Leu Thr Pro Pro He He Pro Ser Val Ala 

645 650 655 



TCA CCC ACA GAC ACA AGT AAT TTT GAC AGT TTC CCT GAG GAC AAC GAT 2 016 
Ser Pro Thr Asp Thr Ser Asn Phe Asp Ser Phe Pro Glu Asp Asn Asp 

660 665 670 



GAA CCA CCA CCT GAT GAC AAC TCA GGA TGG GAT ATA GAC TTC TCG GAT 2 064 
Glu Pro Pro Pro Asp Asp Asn Ser Gly Trp Asp He Asp Phe Ser Asp 
40 675 680 685 

CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 2112 
Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
690 695 700 



GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 216 0 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 



n c 



ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 22 S 6 

55 Thr Leu Lys Phe He Cys Thr Thr Glv Lvs Leu Pro Va: Pro Tro Pre 
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ACC CTC QTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC 2 3 04 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 
755 760 765 

5 

CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 23 52 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
770 775 780 

10 GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 24 0 0 
Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 
785 790 795 800 



15 



45 



AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC 244 8 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

805 810 815 



ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 24 96 
lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
20 820 825 830 

CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 2 544 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
835 840 845 

25 

GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 25 92 
Asp Lys Gin Lys Asn Gly He Lys Val Asn Fhe Lys He Arg His Asn 
850 855 860 

30 ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 2 64 0 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
865 870 875 880 

CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC 2 688 

35 Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

885 890 895 

ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG 273 6 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
40 900 905 910 

GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 27 84 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
915 920 925 



GAG CTG TAC AA GTAA 27 9 9 

Glu Leu Tyr Lys 

n T n 



:i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 932 ammo acids 
55 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 

Met Gly Thr Leu Arg Asp Leu Gin Tyr Ala Leu Gin Glu Lys lie Glu 

15 10 15 

Glu Leu Arg Gin Arg Asp Ala Leu He Asp Glu Leu Glu Leu Glu Leu 

20 25 30 

Asp Gin Lys Asp Glu Leu He Gin Lys Leu Gin Asn Glu Leu Asp Lys 

35 40 45 

Tyr Arg Ser Val He Arg Pro Ala Thr Gin Gin Ala Gin Lys Gin Ser 
^5 50 55 60 

Ala Ser Thr Leu Gin Gly Glu Pro Arg Thr Lys Arg Gin Ala He Ser 
^5 70 75 80 

Ala Glu Pro Thr Ala Phe Asp He Gin Asp Leu Ser His Val Thr Leu 

85 90 95 

Pro Phe Tyr Pro Lys Ser Pro Gin Ser Lys Asp Leu He Lys Glu Ala 

100 105 110 

He Leu Asp Asn Asp Phe Met Lys Asn Leu Glu Leu Ser Gin He Gin 

115 120 125 

Glu He Val Asp Cys Met Tyr Pro Val Glu Tyr Gly Lys Asp Ser Cys 
25 130 135 140 

He He Lys Glu Gly Asp Val Gly Ser Leu Val Tyr Val Met Glu Asp 
145 150 155 160 

Gly Lys Val Glu Val Thr Lys Glu Gly Val Lys Leu Cys Thr Met Gly 

165 170 175 

Pro Gly Lys Val Phe Gly Glu Leu Ala He Leu Tyr Asn Cys Thr Arg 

180 185 190 

Thr Ala Thr Val Lys Thr Leu Val Asn Val Lys Leu Trp Ala He Asp 

195 200 205 

Arg Gin Cys Phe Gin Thr He Met Met Arg Thr Gly Leu He Lys His 
35 210 215 220 

Thr Glu Tyr Met Glu Phe Leu Lys Ser Val Pro Thr Phe Gin Ser Leu 
225 230 235 240 

Pro Glu Glu He Leu Ser Lys Leu Ala Asp Val Leu Glu Glu Thr His 

245 250 255 

Tyr Glu Asn Gly Glu Tyr He He Arg Gin Gly Ala Arg Gly Asp Thr 

260 265 270 

Phe Phe He He Ser Lys Gly Thr Val Asn Val Thr Arg Glu Asp Ser 

275 280 285 

Pro Ser Glu Asp Pro Val Phe Leu Arg Thr Leu Gly Lys Gly Asp Trp 

290 295 300 

Phe Gly Glu Lys Ala Leu Gin Gly Glu Asp Val Arg Thr Ala Asn Val 
305 310 315 320 

He Ala Ala Glu Ala Val Thr ^-/n ^ 



55 



J 3 

Acn Leu Lys Leu Se: 
3 70 



3 6 0 3 6 5 

Asp Phe Asn He He Asp Thr Leu Gly 
375 380 



Val Gly 



p o Tu, V- ■ 
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10 



15 



20 



25 



30 



35 



40 



45 



385 










3 90 










395 










400 


Thr 


Phe 


Ala 


Met 


Lys 


He 


Leu 


Lys 


Lys 


Arg 


His 


He 


Val 


Asp 


Thr 


Arg 










405 










^ X u 










415 


Gin 


Gin 


Glu 


His 


He 


Arq 


Ser 


Glu 


Lys 


Gin 


He 


Met 


Gin Gly Ala 


His 








420 










425 










4 3 0 






Ser 


Asp 


Phe 


He 


Val 


Arq 


Leu 


Tyr Arg 


Th r 


Phe 


Lys 


7\ 

ASp 


ber 


Liys 


Tyr 






435 










4 4 n 

^ *± u 










A A C 

44 3 




Leu 


Tyr 


Met 


Leu 


Met 


Glu 


Ala 


Cys 


Leu 


Gly 


Gly Glu 


Leu 


Trp 


Thr 


He 




450 










*± D D 










460 








Leu 


Arq 


Asp 


Arg 


Gly 


Ser 


Phe 


O JL U, 




Q o r* 

OCX 


Thr 


Tnr 


Arg 


Phe 


Tyr 


Thr 


465 










470 










475 






4 80 


Ala 


Cys 


Val 


Val 


Glu 

/toe 


Ala 


Phe 


MX a 


Tyr 


Leu 

*i J U 


Hi S 


Ser 


Lys 


Gly 


He 
495 


He 


Tyr 


Arg 


Asp 


Leu 


Lys 


Pro 


Glu 


M.sn 


Leu 


He 


Leu 


Asp 


His 


Arg 


Gly 


Tyr 








500 










CT r\ CT 

5 05 










510 




Ala 


Lys 


Leu 


Val 


Asp 


Phe 


Glv 


fne 


Ai.a 




Lys 


He 


Gly 


Phe 


Gly 


Lys 






515 










^ "5 A 
O ^ U 










^ ^ ^ 

525 




Lys 


Thr 
530 


Trp 


Thr 


Phe 


Cys 


Glv 

3 J Q 




Jr r o 


(jX U 


Tyr 


vai 
540 


TV 1 _ 

Ala 


Pro 


Glu 


He 


lie 


Leu 


Asn 


Lvs 


Gly His 




X J. \Z 


Q ^ Y" 
OCX 


>VX a 


ASp 


Tyr 


-yi* VN 

Trp 


ber 


Leu 


Gly 


545 










550 










c c c 

5 55 










560 


He 


Leu 


Met 


Tvr 


Glu 
565 


Leu 


Ti**T 1 


1 iir 


(jxy 


C7 ^ >- 

tber 

c 




Fro 


pne 


Ser 


Gly 
575 


Pro 


Asp 


Pro 


Met 


Lvs 
5 8 0 


Thr 


Tyr 




X i. C 


Tie. 
X X6 

CDC 


x^eu 


Arg 


oxy 


He 


Asp 
590 


Met 


He 


Glu 


Phe 


Pro 


Lys 


Lys 


He 


Al a 


Lys 


Asn 


i\X a 


Ala 


Asn 


Leu 


He 


Lys 


Lys 






c n c 










600 










^ r\ r" 

605 




Leu 


Cys 


Ara 

-J 


Asp 


Asn 


Pro 




Glu 


Arg 


XjCU 


ox y 




IjC LL 


Xiy s 


Asn 


Gly 




610 










U X J 










62 0 








Val 


Lys 


Agp 


He 


Gin 


Lys 


His 


Lys 


Trp 


It ilC 


Glu 


Gly 


Phe 


Asn 


Trp 


^jlU 


625 










630 










635 








64 0 


Gly 


Leu 


Arg 


Lys 


Gly Thr 


Leu 


Thr 


Pro 


Pro 


He 


He 


t^x O 


Cot" 
OCX 


Val 


AX a 










645 










*c Q n 

O 3 U 










655 




Ser 


Pro 


Thr 


Asp 


Thr 


Ser 


Asn 


Phe 


Asp 


Ser 


Phe 


Pro 


Glu 


Asp 


Asn 


Asp 








660 










665 










670 




Glu 


Pro 


Pro 


Pro 


Asp 


Asp 


Asn 


Ser 


Gly 




Asp 


He 


Asp 


Phe 


Ser 


Asp 






675 










680 










685 






Pro 


Pro 


Val 


Ala 


Thr 


Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly 




690 










695 










700 








Val 


Val 


Pro 


He 


Leu 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


705 










710 










715 










720 


Phe 


Ser 


Val 


Ser 


Gly Glu 


Gly Glu Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 










725 










730 










735 




Thr 


Leu 


Lys 


Phe 
740 


He 


Cys 


Thr 


Thr 


Gly 
745 


Lys 


Leu 


Pro 


Val 


Pro 
750 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 


Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 






755 










760 










765 





55 



-.J.. Hsp ^v.v. ,ai /\sa ^rg 

805 810 815 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

820 825 B3C 
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835 840 845 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 

950 855 860 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
865 870 875 880 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

885 890 895 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

900 905 910 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

915 920 925 

Glu Leu Tyr Lys 

A 

15 (2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 2184 base pairs 
(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

25 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2181 
(D) OTHER INFORMATION: 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

35 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 9 6 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

40 GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 14 4 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



45 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



55 



In Fiis Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 
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Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAG ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

5 Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

ATC GAG TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 43 2 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
10 130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser Kib Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 

15 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

20 GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 5 76 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 IBS 190 



25 



35 



40 



45 



CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
30 210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



GGA CTC AGA TCT CGA GGC ACC ATG AGC GAC GTG GCT ATT GTG AAG GAG 7 68 

Gly Leu Arg Ser Arg Gly Thr Met Ser Asp Val Ala He Val Lys Glu 

245 250 255 

GGT TGG CTG CAC AAA CGA GGG GAG TAC ATC AAG ACC TGG CGG CCA CGC 816 

Gly Trp Leu His Lys Arg Gly Glu Tyr He Lys Thr Trp Arg Pro Arg 

260 265 270 

TAC TTC CTC CTC AAG AAT GAT GGC ACC TTC ATT GGC TAC AAG GAG CGG 8 64 

Tyr Phe Leu Leu Lys Asn Asp Gly Thr Phe He Gly Tyr Lys Glu Arg 

275 280 285 



55 



^ ^ V_7 ^ ^_ 



c 



Ala Gin Cys Gin Leu Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe 
305 310 315 32C 
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lie lie Arg 



GTG GAG ACT 
5 Val Glu Thr 



GTG GCT GAC 
Val Ala Asp 
10 355 

TCG GGC TCA 
SejL Gly Ser 
370 

15 

CTG GCC AAG 
Leu Ala Lys 
385 

20 AAG CTG CTG 
Lys Leu Leu 

AAG GCC ACA 
25 Lys Ala Thr 



ATC GTG GCC 
He Val Ala 
30 435 

CTG CAG AAC 
Leu Gin Aan 
450 

35 

CAG ACC CAC 
Gin Thr His 
465 

40 GAG CTG TTC 
Glu Leu Phe 

GCC CGC TTC 
45 Ala Arg Phe 



Cys Leu Gin Trp Thr 
325 

CCT GAG GAG CGG GAG 
Pro Glu Glu Arg Glu 
340 

GGC CTC AAG AAG CAG 
Gly Leu Lys Lys Gin 

360 

CCC AGT GAC AAC TCA 
Pro Ser Asp Asn Ser 

375 

CCC AAG CAC CGC GTG 
Pro Lys His Arg Val 
390 

GGC AAG GGC ACT TTC 
Gly Lys Gly Thr Phe 
405 

GGC CGC TAC TAG GCC 
Gly Arg Tyr Tyr Ala 
420 

AAG GAC GAG GTG GCC 
Lys Asp Glu Val Ala 

440 

TCC AGG CAC CCC TTC 
Ser Arg His Pro Phe 

455 

GAC CGC CTC TGC TTT 
Asp Arg Leu Cys Phe 
470 

TTC CAC CTG TCC CGG 
Phe His Leu Ser Arg 
485 

TAT GGC GCT GAG ATT 
Tyr Gly Ala Glu He 
500 



285 

Thr Val Ha Glu Arg 
330 

GAG TGG ACA ACC GCC 
Glu Trp Thr Thr Ala 
345 

GAG GAG GAG GAG ATG 
Glu Glu Glu Glu Met 

365 

GGG GCT GAA GAG ATG 
Gly Ala Glu Glu Met 

380 

ACC ATG AAC GAG TTT 
Thr Met Asn Glu Phe 
395 

GGC AAG GTG ATC CTG 
Gly Lys Val He Leu 
410 

ATG AAG ATC CTC AAG 
Met Lys He Leu Lys 
425 

CAC ACA CTC ACC GAG 
His Thr Leu Thr Glu 

445 

CTC ACA GCC CTG AAG 
Leu Thr Ala Leu Lys 

460 

GTC ATG GAG TAC GCC 
Val Met Glu Tyr Ala 
475 

GAA CGT GTG TTC TCC 
Glu Arg Val Phe Ser 
490 

GTG TCA GCC CTG GAC 
Val Ser Ala Leu Asp 
505 



Thr Phe His 
335 

ATC CAG ACT 105 6 

He Gin Thr 

350 

GAC TTC CGG 1104 
Asp Phe Arg 

GAG GTG TCC 1152 
Glu Val Ser 



GAG TAC CTG 12 0 0 
Glu Tyr Leu 
400 

GTG AAG GAG 1248 
Val Lys Glu 
415 

AAG GAA GTC 12 96 

Lys Glu Val 

430 

AAC CGC GTC 1344 
Asn Arg Val 

TAC TCT TTC 13 92 
Tyr Ser Phe 



AAC GGG GGC 14 4 0 

Asn Gly Gly 
480 

GAG GAC CGG 14 8 8 

Glu Asp Arg 
495 

TAC CTG CAC 15 3 6 

Tyr Leu His 

510 



- - -..siV-^ ^--wx^-: -^r^-^ ^_:;v.:'._- /\ ^ i /-vrt.io /-v i ^„ M.w/-\ ^ * w >^ j. i. v_ 

Leu Asp Lys Asp Gly His lie Lys He Thr Asp Phe Gl'/ Leu Cys Lys 

530 535 540 

55 
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Glu Gly lie Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro 

545 550 555 560 

GAG TAG CTG GCC CCC GAG GTG CTG GAG GAG AAT GAG TAG GGC CGT GCA 172 8 

5 Glu Tyr Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala 

565 570 575 

GTG GAG TGG TGG GGG CTG GGC GTG GTC ATG TAG GAG ATG ATG TGC GGT 177 6 

Val Asp Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly 
10 580 585 590 

CGC CTG CCC TTC TAG AAC CAG GAC CAT GAG AAG CTT TTT GAG CTC ATG 1824 

Arg Leu Pro pne Tyr Asn Gin Asp His Glu Lys Leu Phe Glu Leu lie 

595 600 605 

15 

CTC ATG GAG GAG ATG CGC TTC CCG CGC ACG CTT GGT CCC GAG GCC AAG 18 72 

Leu Met Glu Glu He Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys 
610 615 620 

20 TCC TTG CTT TCA GGG CTG CTC AAG AAG GAC CCG AAG CAG AGG CTT GGC 192 0 

Ser Leu Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gin Arg Leu Gly 

625 630 635 640 



25 



35 



40 



GGG GGC TCC GAG GAC GCC AAG GAG ATG ATG CAG CAT CGC TTC TTT GCC 196 8 

Gly Gly Ser Glu Asp Ala Lys Glu He Met Gin His Arg Phe Phe Ala 

645 650 655 



GGT ATG GTG TGG CAG CAC GTG TAG GAG AAG AAG CTC AGG CCA CCC TTC 2 016 
Gly He Val Trp Gin His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe 
30 660 665 670 

AAG CCC CAG GTC ACG TCG GAG ACT GAC ACG AGG TAT TTT GAT GAG GAG 2 064 
Lys Pro Gin Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu 
675 680 685 



TTC ACG GCC CAG ATG ATC ACC ATC ACA CCA CCT GAC CAA GAT GAC AGC 2112 

Phe Thr Ala Gin Met He Thr He Thr Pro Pro Asp Gin Asp Asp Ser 
690 695 700 

ATG GAG TGT GTG GAC AGC GAG CGC AGG CCC CAC TTC CCC CAG TTC TCC 216 0 

Met Glu Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gin Phe Ser 
705 710 715 720 



TAG TCG GCC AGC AGC ACG GCC TGA 2184 
45 Tyr Ser Ala Ser Ser Thr Ala 

725 



■ A, l^ENGTK : 'y^'^ aiTiir.c aci 

(B) TYPE: amir.o acid 

(C) STRANDEDNESS : single 
55 'D) TOPOLOGY: 1 mea- 
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10 



20 



30 



40 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 60 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
15 65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 3^40 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
25 145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
35 225 230 235 240 

Gly Leu Arg Ser Arg Gly Thr Met Ser Asp Val Ala He Val Lys Glu 

245 250 255 

Gly Trp Leu His Lys Arg Gly Glu Tyr He Lys Thr Trp Arg Pro Arg 

260 265 270 

Tyr Phe Leu Leu Lys Asn Asp Gly Thr Phe He Gly Tyr Lys Glu Arg 

275 280 285 

Pro Gin Asp Val Asp Gin Arg Glu Ala Pro Leu Asn Asn Phe Ser Val 

290 295 300 

Ala Gin Cys Gin Leu Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe 
45 305 310 315 320 

He He Arg Cys Leu Gin Trp Thr Thr Val He Glu Arg Thr Phe His 

325 330 335 



55 



i ^ -^1 Msr. 

375 



3'^^ 375 380 

Leu Ala Lys Pro Lys His Arg Val Thr Met Asn Glu Phe Glu Tyr Lev 



385 



3 90 



39 



4 C 
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405 410 415 

Lys Ala Thr Gly Arg Tyr Tyr Ala Met Lys He Leu Lys Lys Glu Val 

420 425 430 

He Val Ala Lys Asp Glu Val Ala His Thr Leu Thr Glu Asn Arg Val 

435 440 445 

Leu Gin Asn Ser Arg His Pro Phe Leu Thr Ala Leu Lys Tyr Ser Phe 

450 455 460 

Gin Thr His Asp Arg Leu Cys Phe Val Met Glu Tyr Ala Asn Gly Gly 
465 470 475 480 

Glu Leu Phe Phe His Leu Ser Arg Glu Arg Val Phe Ser Glu Asp Arg 

485 490 495 

Ala Arg Phe Tyr Gly Ala Glu He Val Ser Ala Leu Asp Tyr Leu His 

500 505 510 

Ser Glu Lys Asn Val Val Tyr Arg Asp Leu Lys Leu Glu Asn Leu Met 
15 515 520 525 

Leu Asp Lys Asp Gly His He Lys He Thr Asp Phe Gly Leu Cys Lys 

530 535 540 

Glu Gly He Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro 
545 550 555 560 

Glu Tyr Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala 

565 570 575 

Val Asp Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly 

580 585 590 

Arg Leu Pro Phe Tyr Asn Gin Asp His Glu Lys Leu Phe Glu Leu He 
25 595 600 605 

Leu Met Glu Glu He Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys 

610 615 620 

Ser Leu Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gin Arg Leu Gly 
625 630 635 640 

Gly Gly Ser Glu Asp Ala Lys Glu He Met Gin His Arg Phe Phe Ala 

645 650 655 

Gly He Val Trp Gin His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe 

660 665 670 

Lys Pro Gin Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu 
35 675 680 685 

Phe Thr Ala Gin Met He Thr He Thr Pro Pro Asp Gin Asp Asp Ser 

690 695 700 

Met Glu Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gin Phe Ser 
705 710 715 720 

40 Tyr Ser Ala Ser Ser Thr Ala 

725 

(2) INFORMATION FOR SEQ ID NO: 14 0: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2394 base pairs 

(B) TYPE: nucleic acid 



30 



FEATURE 



(A) NAME/KEY: Coding Sequence 
55 fB^ LOCATIOK: l..,2 3 9: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

ATG GAC GAA CTG TTC CCC CTC ATC TTC CCG GCA GAG CCA GCC CAG GCC 4 8 

5 Met Asp Glu Leu Phe Pro Leu lie Phe Pro Ala Glu Pro Ala Gin Ala 
is 10 15 

TCT GGC CCC TAT GTG GAG ATC ATT GAG CAG CCC AAG CAG CGG GGC ATG 96 
Ser Gly Pro Tyr Val Glu lie lie Glu Gin Pro Lys Gin Arg Gly Met 
10 20 25 30 

CGC TTC CGC TAC AAG TGC GAG GGG CGC TCC GCG GGC AGC ATC CCA GGC 144 
..^^ * xyj. jjyo \j±u Ljxy /^rg ^^er Aia uiy aer lie Pro Gly 

35 40 45 

15 

GAG AGG AGC ACA GAT ACC ACC AAG ACC CAC CCC ACC ATC AAG ATC AAT 192 

Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pro Thr lie Lys He Asn 
50 55 60 

20 GGC TAC ACA GGA CCA GGG ACA GTG CGC ATC TCC CTG GTC ACC AAG GAC 24 0 

Gly Tyr Thr Gly Pro Gly Thr Val Arg He Ser Leu Val Thr Lys Asp 
65 70 75 80 



25 



35 



40 



45 



CCT CCT CAC CGG CCT CAC CCC CAC GAG CTT GTA GGA AAG GAC TGC CGG 288 
Pro Pro His Arg Pro His Pro His Glu Leu Val Gly Lys Asp Cys Arg 

85 90 95 



GAT GGC TTC TAT GAG GCT GAG CTC TGC CCG GAC CGC TGC ATC CAC AGT 33 6 

Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro Asp Arg Cys He His Ser 
30 100 105 110 

TTC CAG AAC CTG GGA ATC CAG TGT GTG AAG AAG CGG GAC CTG GAG CAG 3 84 

Phe Gin Asn Leu Gly He Gin Cys Val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 



GCT ATC AGT CAG CGC ATC CAG ACC AAC AAC AAC CCC TTC CAA GTT CCT 4 32 

Ala He Ser Gin Arg He Gin Thr Asn Asn Asn Pro Phe Gin Val Pro 
130 135 140 

ATA GAA GAG CAG CGT GGG GAC TAC GAC CTG AAT GCT GTG CGG CTC TGC 4 80 

He Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu Cys 

145 150 155 160 

TTC CAG GTG ACA GTG CGG GAC CCA TCA GGC AGG CCC CTC CGC CTG CCG 52 8 

Phe Gin Val Thr Val Arg Asp Pro Ser Gly Arg Pro Leu Arg Leu Pro 

165 170 175 



55 



^/i J~\^\:^ ^ J. J\\.jy^ 

eu Lys He Cys Arg Val Asn Arq Asn Ser Gly Ser Cys Leu Gly 
195 200 205 
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Gly Asp Glu lie Phe Leu Leu Cys Asp Ly3 Val Gin Lys Glu Asp lie 
210 215 220 

GAG GTG TAT TTC ACG GGA CCA GGC TGG GAG GCC CGA GGC TCC TTT TCG 72 0 

5 Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe Ser 
225 230 235 240 

CAA GOT GAT GTG CAC CGA CAA GTG GCC ATT GTG TTC CGG ACC CCT CCC 76 8 

Gin Ala Asp Val His Arg Gin Val Ala lie Val Phe Arg Thr Pro Pro 
10 245 250 255 



15 



TAG GCA GAC CCC AGC CTG CAG GCT CCT GTG CGT GTC TCC ATG CAG CTG 816 
Tyr Ala Asp Fro 3cr Leu Gin Ala Fio Vdl Arg Val Ser Met Gin Leu 

260 265 270 



CGG CGG CCT TCC GAC CGG GAG CTC AGT GAG CCC ATG GAA TTC CAG TAC 8 64 

Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu Pro Met Glu Phe Gin Tyr 
275 280 285 

20 CTG CCA GAT AC A GAC GAT CGT CAC CGG ATT GAG GAG AAA CGT AAA AGG 912 
Leu Pro Asp Thr Asp Asp Arg Hie Arg lie Glu Glu Lys Arg Lys Arg 
290 295 300 

ACA TAT GAG ACC TTC AAG AGC ATC ATG AAG AAG AGT CCT TTC AGC GGA 960 
25 Thr Tyr Glu Thr Phe Lys Ser lie Met Lys Lys Ser Pro Phe Ser Gly 
305 310 315 320 

CCC ACC GAC CCC CGG CCT CCA CCT CGA CGC ATT GCT GTG CCT TCC CGC 100 8 
Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg lie Ala Val Pro Ser Arg 
30 325 330 335 

AGC TCA GCT TCT GTC CCC AAG CCA GCA CCC CAG CCC TAT CCC TTT ACG 105 6 
Ser Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr 

340 345 350 

35 

TCA TCC CTG AGC ACC ATC AAC TAT GAT GAG TTT CCC ACC ATG GTG TTT 1104 
Ser Ser Leu Ser Thr He Asn Tyr Asp Glu Phe Pro Thr Met Val Phe 
355 360 365 

40 CCT TCT GGG CAG ATC AGC CAG GCC TCG GCC TTG GCC CCG GCC CCT CCC 115 2 

Pro Ser Gly Gin He Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro 
370 375 380 

CAA GTC CTG CCC CAG GCT CCA GCC CCT GCC CCT GCT CCA GCC ATG GTA 12 0 0 
45 Gin Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val 
385 390 395 400 
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Pro Pro Gin Ala Val 

420 



Ala Fro Pro Ala Pro Lys Pre Thr Gin Ala Gly 

425 43C 
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Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu 
435 440 445 

GAC CTG GGG GCC TTG CTT GGC AAC AGC ACA GAG CCA GCT GTG TTC ACA 13 92 

5 Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr 
450 455 460 

GAC CTG GCA TCC GTC GAC AAC TCC GAG TTT CAG CAG CTG CTG AAC CAG 144 0 

Asp Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin 
10 465 470 475 480 

GGC ATA OCT GTG GCC CCC CAC ACA ACT GAG CCC ATG CTG ATG GAG TAC 14 8 8 

J . . — ^ Aixj- \jj.u cj.\j I'lCU i-icu. i'Jtru ^jJ-U. ^ 

485 490 495 

15 

CCT GAG GCT ATA ACT CGC CTA GTG ACA GGG GCC CAG AGG CCC CCC GAC 15 36 

Pro Glu Ala lie Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp 

500 505 510 

20 CCA GCT CCT GCT CCA CTG GGG GCC CCG GGG CTC CCC AAT GGC CTC CTT 15 84 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 
515 520 525 



25 



35 



40 



45 



TCA GGA GAT GAA GAC TTC TCC TCC ATT GCG GAC ATG GAC TTC TCA GCC 1632 
Ser Gly Asp Glu Asp Phe Ser Ser lie Ala Asp Met Asp Phe Ser Ala 
530 535 540 



CTG CTG AGT CAG ATC AGC TCC TTG GAT CCA CCG GTC GCC ACC ATG GTG 1660 
Leu Leu Ser Gin lie Ser Ser Leu Asp Pro Pro Val Ala Thr Met Val 
30 545 550 555 560 

AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG 172 8 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 

565 570 575 



CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC 17 7 6 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

580 585 590 

GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC 18 24 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 

595 600 605 

ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC 18 72 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 

610 615 620 
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Asp Phe Fhe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Th: 

645 650 655 
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lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

660 665 670 

TTC GAG GGC GAG ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAG 2 0 64 
5 Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 

675 680 6B5 

TTC AAG GAG GAG GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG AAC TAG 2112 
Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr 
10 690 695 700 

AAC AGC CAC AAC GTC TAT ATC ATG GCC GAG AAG GAG AAG AAC GGC ATC 216 0 

Asn Ser Kia Aan Val Tyr He Men Ala Asp Lys Gin Lys Asn Gly He 

705 710 715 720 

15 

AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAG GGC AGC GTG CAG 2 2 08 

Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 

725 730 735 

20 CTG GCC GAG CAC TAG CAG CAG AAC ACC CGC ATC GGC GAC GGC CGC GTG 2256 
Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 

740 745 750 



25 



35 



CTG CTG GCC GAC AAC CAC TAG CTG AGC ACC CAG TCC GCC CTG AGC AAA 2 3 04 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
755 760 765 



GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC 2 3 52 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
30 770 775 780 

GCC GCC GGG ATC ACT CTG GGC ATG GAC GAG CTG TAG AAG TAA 2 3 94 

Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 

785 790 795 



(2) INFORMATION FOR SEQ ID NO: 141 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 7 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



20 25 30 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser He Pro Gly 
55 3 5 4 0 4 5 
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Gly Tyr Thr Gly 
65 

Pro Pro His Arg 

Asp Gly Phe Tyr 

100 

Phe Gin Asn Leu 
115 

Ala lie Ser Gin 
130 

lie Glu Glu Gin 
i4S 

Phe Gin Val Thr 

Pro Val Leu Pro 

180 

Glu Leu Lys He 
195 

Gly Asp Glu He 
210 

Glu Val Tyr Phe 
225 

Gin Ala Asp Val 

Tyr Ala Asp Pro 

260 

Arg Arg Pro Ser 
275 

Leu Pro Asp Thr 
290 

Thr Tyr Glu Thr 
305 

Pro Thr Asp Pro 

Ser Ser Ala Ser 

340 

Ser Ser Leu Ser 
355 

Pro Ser Gly Gin 
370 

Gin Val Leu Pro 
385 

Ser Ala Leu Ala 

Pro Pro Gin Ala 

420 



/^sp Leu A^a Ser 
465 

Gly He Pro Val 



55 

Pro Gly Thr Val 
70 

Pro His Pro His 
85 

Glu Ala Glu Leu 

Gly He Gin Cys 

120 

Arg He Gin Thr 
135 

Arg Gly Asp Tyr 
150 

Val Arg Asp Pro 
165 

His Pro He Phe 

Cys Arg Val Asn 

200 

Phe Leu Leu Cys 
215 

Thr Gly Pro Gly 
230 

His Arg Gin Val 
245 

Ser Leu Gin Ala 

Asp Arg Glu Leu 

280 

Asp Asp Arg His 
295 

Phe Lys Ser He 
310 

Arg Pro Pro Pro 
325 

Val Pro Lys Pro 

Thr He Asn Tyr 

360 

He Ser Gin Ala 
375 

Gin Ala Pro Ala 
390 

Gin Ala Pro Ala 
405 

Val Ala Pro Pro 



■.' Ti ^ A s p A G r. S e 1 
470 

Ala Pro His Thr 
485 



293 

60 

Arg He Ser Leu 
75 

Glu Leu Val Gly 
90 

Cys Pro Asp Arg 
105 

Val Lys Lys Arg 

Asn Asn Asn Pro 

140 

Asp Leu Asn Ala 
155 

Ser Gly Arg Pro 
170 

Asp Asn Arg Ala 
185 

Arg Asn Ser Gly 

Asp Lys Val Gin 

220 

Trp Glu Ala Arg 
235 

Ala He Val Phe 
250 

Pro Val Arg Val 
265 

Ser Glu Pro Met 

Arg He Glu Glu 

300 

Met Lys Lys Ser 
315 

Arg Arg He Ala 
330 

Ala Pro Gin Pro 
345 

Asp Glu Phe Pro 

Ser Ala Leu Ala 

380 

Pro Ala Pro Ala 
395 

Pro Val Pro Val 
410 

Ala Pro Lys Pro 
425 



475 

Thr Glu Pro Met 

4 9C 



Val Thr Lys Asp 

80 

Lys Asp Cys Arg 
95 

Cys He His Ser 
110 

Asp Leu Glu Gin 
125 

Phe Gin Val Pro 

Val Arg Leu Cys 

160 

Leu Arg Leu Pro 
175 

Pro Asn Thr Ala 
190 

Ser Cys Leu Gly 
205 

Lys Glu Asp He 

Gly Ser Phe Ser 

240 

Arg Thr Pro Pro 
255 

Ser Met Gin Leu 
270 

Glu Phe Gin Tyr 
285 

Lys Arg Lys Arg 

Pro Phe Ser Gly 

320 

Val Pro Ser Arg 
335 

Tyr Pro Phe Thr 
350 

Thr Met Val Phe 
365 

Pro Ala Pro Pro 

Pro Ala Met Val 

400 

Leu Ala Pro Gly 
415 

Thr Gin Ala Gly 
430 



j_i c ^ 1^ s u / i G n o ^ I » 

48C 

Leu Met Glu Tyr 

4 9S 
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294 

500 505 510 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 

515 520 525 

Ser Gly Asp Glu Asp Phe Ser Ser He Ala Asp Met Asp Phe Ser Ala 

530 535 540 

Leu Leu Ser Gin He Ser Ser Leu Asp Pro Pro Val Ala Thr Met Val 
545 550 555 560 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 

565 570 575 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

5B0 585 590 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 

COC ^ r\ r\ ^ w- 

OWU OUD 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
15 610 615 620 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
625 630 635 640 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 

645 650 655 

20 He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

660 665 670 

Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 

675 680 685 

Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 
25 690 695 700 

Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 

'^rtc *-t<iA ^ 

/j-u /li) 
Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 

725 730 735 

30 Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 

740 745 750 

Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

755 760 765 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 

35 770 775 780 

Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
765 790 795 



40 



(2) INFORMATION FOR SEQ ID NO: 142 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 
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Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAG GGC GAG GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
5 Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Lea Thr Leu Lys Phe He 
10 35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

rS/c TiVi >- TiVt -K- nlir Ttr^ T,^,-. n*-™ Tr^T m t ^\ 

w -i.**^ ii*.^ w-i-jr j^j ^ jji^t^ cj.\^ V dj. cs.^^ ^ ^ ±' i- U XilL UCU V CL X liii. IIIL 

50 55 60 

CTG ACC TAG GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG 24 0 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

20 CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAG GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



15 



25 



45 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAG AAG ACC CGC GCC GAG 33 6 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

30 115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAG 4 32 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

35 

AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

150 155 160 

40 GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 



GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 
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Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 
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Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCC ATG GAG GAA CTG TTC CCC CTC ATC TTC CCG 76 8 

5 Gly Leu Arg Ser Arg Ala Met Asp Glu Leu Phe Pro Leu He Phe Pro 

245 250 255 

GCA GAG CCA GCC CAG GCC TCT GGC CCC TAT GTG GAG ATC ATT GAG CAG 816 
Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He He Glu Gin 
10 260 265 270 

CCC AAG CAG CGG GGC ATG CGC TTC CGC TAC AAG TGC GAG GGG CGC TCC 8 64 

Pro Lys Gin Arg (Jly Met Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser 
275 280 285 

15 

GCG GGC AGC ATC CCA GGC GAG AGG AGC ACA GAT ACC ACC AAG ACC CAC 912 
Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr Lys Thr His 
290 295 300 

20 CCC ACC ATC AAG ATC AAT GGC TAC ACA GGA CCA GGG ACA GTG CGC ATC 96 0 

Pro Thr He Lys He Asn Gly Tyr Thr Gly Pro Gly Thr Val Arg He 
305 310 315 320 

TCC CTG GTC ACC AAG GAC CCT CCT CAC CGG CCT CAC CCC CAC GAG CTT 1008 
25 Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro His Glu Leu 

325 330 335 

GTA GGA AAG GAC TGC CGG GAT GGC TTC TAT GAG GCT GAG CTC TGC CCG 1056 
Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro 
30 340 345 350 

GAC CGC TGC ATC CAC AGT TTC CAG AAC CTG GGA ATC CAG TGT GTG AAG 1104 

Asp Arg Cys He His Ser Phe Gin Aan Leu Gly He Gin Cys Val Lys 

355 360 365 

35 

AAG CGG GAC CTG GAG CAG GCT ATC AGT CAG CGC ATC CAG ACC AAC AAC 1152 

Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin Thr Asn Asn 

370 375 380 

40 AAC CCC TTC CAA GTT CCT ATA GAA GAG CAG CGT GGG GAC TAC GAC CTG 12 0 0 

Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp Tyr Asp Leu 
385 390 395 400 



45 



AAT GCT GTG CGG CTC TGC TTC CAG GTG ACA GTG CGG GAC CCA TCA GGC 124 8 
Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp Pro Ser Gly 

405 410 415 



55 



— ^.j ^ w >„^L, t\K.,^ v., oMu ^,1-^ M^^j ^ ^^ i o 'v_vjA i ; /Vj-^C /v^*. ... . J *-t -t 

Arg Ala Pro Asn Thr Ala Glu Leu Lys He Cys Arg Val Asn Arg Asn 
435 440 445 
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Ser Gly Ser Cys Leu Gly Gly Asp Glu He Phe Leu Leu Cys Asp Lys 
450 455 460 

GTG CAG AAA GAG GAC ATT GAG GTG TAT TTC ACG GGA CCA GGC TGG GAG 144 0 
5 Val Gin Lys Glu Asp He Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu 

470 475 480 

GCC CGA GGC TCC TTT TCG CAA GCT GAT GTG CAC CGA CAA GTG GCC ATT 14 8 8 

Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin Val Ala He 
10 485 490 495 

GTG TTC CGG ACC CCT CCC TAC GCA GAC CCC AGC CTG CAG GCT CCT GTG 15 3 6 



15 



20 



25 



35 



40 



45 



55 



j*^" 1^.. . ^ . 



3 



T"!^*- rtvn "PluA m. . nl-n ■.^ _ _ 

xy± tro t>er L.eu um Ala Pro Val 

500 505 51Q 



CGT GTC TCC ATG CAG CTG CGG CGG CCT TCC GAC CGG GAG CTC AGT GAG 15 84 
Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu 
515 520 525 

CCC ATG GAA TTC CAG TAC CTG CCA GAT ACA GAC GAT CGT CAC CGG ATT 163 2 

Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg His Arg He 
530 535 540 

GAG GAG AAA CGT AAA AGG ACA TAT GAG ACC TTC AAG AGC ATC ATG AAG 168 0 

Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser He Met Lys 

550 555 560 



AAG AGT CCT TTC AGC GGA CCC ACC GAC CCC CGG CCT CCA CCT CGA CGC 172 8 
Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg 
30 565 570 575 

ATT GCT GTG CCT TCC CGC AGC TCA GCT TCT GTC CCC AAG CCA GCA CCC 177 6 

He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys Pro Ala Pro 

580 585 590 



CAG CCC TAT CCC TTT ACG TCA TCC CTG AGC ACC ATC AAC TAT GAT GAG 1824 

Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn Tyr Asp Glu 

595 600 605 

TTT CCC ACC ATG GTG TTT CCT TCT GGG CAG ATC AGC CAG GCC TCG GCC 18 72 

Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin Ala Ser Ala 

615 620 

TTG GCC CCG GCC CCT CCC CAA GTC CTG CCC CAG GCT CCA GCC CCT GCC 192 0 

Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro Ala 

630 635 640 



Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala Pro 

665 670 
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Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin 
675 680 685 

CTG CAG TTT GAT GAT GAA GAG CTG GGG GCC TTG CTT GGC AAC AGO ACA 2112 
Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr 
690 695 700 

GAC CCA GCT GTG TTC ACA GAC CTG GCA TCC GTC GAC AAC TCC GAG TTT 2160 
Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu Phe 
705 710 715 720 

CAG CAG CTG CTG AAC CAG GGC ATA CCT GTG GCC CCC CAC ACA ACT GAG 22 0 8 
Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His Thr Thr Glu 

725 730 735 

CCC ATG CTG ATG GAG TAC CCT GAG GCT ATA ACT CGC CTA GTG ACA GGG 22 56 
Pro Met Leu Met Glu Tyr Pro Glu Ala lie Thr Arg Leu Val Thr Gly 

740 745 750 

GCC CAG AGG CCC CCC GAC CCA GCT CCT GCT CCA CTG GGG GCC CCG GGG 2 3 04 
Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly 
755 760 765 

CTC CCC AAT GGC CTC CTT TCA GGA GAT GAA GAC TTC TCC TCC ATT GCG 2 3 52 
Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser lie Ala 
770 775 780 

GAC ATG GAC TTC TCA GCC CTG CTG AGT CAG ATC AGC TCC TAA 23 9 

Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



^VG 'Inr 'Inr '..J^.^y Ly^ '^e-j Pre '.'a^ Pic ':rp Pre :'::r Leu '^'a ^ 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 -7^ an 
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XJ W ^ 
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Arg 
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Leu 


Pro 
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Gly 


Ser 


Cys 


Leu 


Gly 




450 
















^ 1 ^ ^ 
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Arg 


Val 


Ser 


Met 


Gin 


Leu 



90 

Asp Asp Gly Asn Tyr Lys 
105 

Val Asn Arg lie 



Thr Leu 
120 
Gly Asn 
135 

Val Tyr 

Lys lie 

Tyr Gin 

Asn His 
200 
Lys Arg 
215 

Thr Leu 

Met Asp 

Ser Gly 

Arg Phe 
280 
Glu Arg 
295 

Gly Tyr 

Pro Pro 

Asp Gly 

Phe Gin 
360 

Ala lie 
375 

He Glu 

Phe Gin 

Pro Val 

Glu Leu 
440 
Gly Asp 
455 



He Leu Gly His 

140 

He Met Ala Asp 
155 

Arg His Asn He 
170 

Gin Asn Thr Pro 
155 

Tyr Leu Ser Thr 

Asp His Met Val 

220 

Gly Met Asp Glu 
235 

Glu Leu Phe Pro 
250 

Pro Tyr Val Glu 
265 

Arg Tyr Lys Cys 

Ser Thr Asp Thr 

300 

Thr Gly Pro Gly 
315 

His Arg Pro His 
330 

Phe Tyr Glu Ala 
345 

Asn Leu Gly He 

Ser Gin Arq lie 

380 

Glu Gin Arg Gly 
395 

Val Thr Val Arg 
410 

Leu Pro His Pro 
425 

Lys He Cys Arg 

Glu He Phe Leu 

460 



95 

Thr Arg Ala Glu 
110 

Glu Leu Lys Gly 
125 

Lys Leu Glu Tyr 

Lys Gin Lys Asn 

160 

Glu Asp Gly Ser 
175 

He Gly Asp Gly 
lyu 

Gin Ser Ala Leu 
205 

Leu Leu Glu Phe 

Leu Tyr Lys Ser 

240 

Leu He Phe Pro 
255 

He He Glu Gin 
270 

Glu Gly Arg Ser 
285 

Thr Lys Thr His 

Thr Val Arg He 

320 

Pro His Glu Leu 
335 

Glu Leu Cys Pro 
350 

Gin Cys Val Lys 
365 

Gin Thr Asn Asn 

Asp Tyr Asp Leu 

400 

Asp Pro Ser Gly 
415 

He Phe Asp Asn 
430 

Val Asn Arg Asn 
445 

Leu Cys Asp Lys 



55 



515 



505 
Arg Pro 
520 



510 
Glu Leu Ser 
525 



U 
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530 535 540 

Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser lie Met Lys 
545 550 555 ^60 

Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg 
5 565 570 575 

He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys Pro Ala Pro 

580 585 590 

Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn Tyr Asp Glu 
595 600 605 

10 Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin Ala Ser Ala 
610 615 620 

Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro Ala 

bJU 635 640 

Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro Ala Pro Val 
''5 645 650 655 

Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala Pro 

660 665 670 

Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin 

675 680 685 

Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr 

690 695 700 

Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu Phe 
705 710 715 720 

Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His Thr Thr Glu 
25 725 730 735 

Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu Val Thr Gly 

740 745 750 

Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly 

755 760 765 

Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser He Ala 

770 775 780 

Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser 
785 790 795 



20 



30 
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CLAIMS 

1 . A method for extracting quantitative information relating to an influence on a cellular re- 
sponse, the method comprising recording variation, caused by the influence on a mechani- 

5 cally intact living cell or mechanically intact living cells, in spatially distributed light emitted 
from a iuminophore, the luminophore being present in the cell or cells and being capable of 
being redistributed in a manner which is related with the degree of the influence, and/or of 
being modulated by a component which is capable of being redistributed in a manner which 
is related to the degree of the influence, the association resulting in a modulation of the lu- 
10 minescence characteristics of the luminophore, and processing the recorded variation in the 
spatially distributed light to provide quantitative information correlating the spatial distribution 
to the degree of the influence on the cellular response. 

2. A method according to claim 1, as used for extracting quantitative information relating to 
15 an influence on an intracellular pathway involving redistribution of at least one component 

associated with the pathway, or part thereof, the method comprising recording the result of 
the influence on mechanically intact living cell or cells, as manifested in spatially distributed 
light emitted from a luminophore which is present in the cell or cells and which is capable of 
being redistributed, by modulation of the pathway, in a manner which is related to the redis- 
20 tribution of the at least one component of the intracellular pathway, processing the recorded 
result to provide quantitative information about the spatially distributed light and correlating 
the quantitative information to the degree of the influence on the intracellular pathway. 

3. A method according to claim 1 or 2, wherein the quantitative information which is indica- 
25 tive of the degree of the cellular response to the influence or the result of the influence on 

the intracellular pathway is extracted from the recording or recordings according to a prede- 
termined calibration based on responses or results, recorded in the same manner, to known 



30 4. A method according to any of the preceding claims, wherein the influence is contact be- 
tween the mechanically intact living cell or the group of mechanically intact living cells with a 
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Chemical substance and/or incubation of the mechanically intact living cell or the group of 
mechanically intact living cells with a chemical substance. 



5. A method according to claim 4 wherein the substance is a substance whose effect on an 
5 intracellular pathway is to be determined. 



6. A mfithnd arrnrHinn tn an\/ nf fho r^rar^arHr^r^ ...u»»:.- lu i; ■ 

— ^ J ^, ^1 ww»^<^ii wiaiHio, vviicicm lmc iGuuiuiriy lb inaoe at a 

Single point in time after the application of the influence. 



0 7. A method according to any of claims 1-5, wherein the recording is made at two points in 
time, one point being before, and the other point being after the application of the influence. 

8. A method according to any of claims 1-5, wherein the recording is performed at a series of 
points in time, In which the application of the influence occurs at snmp timp ^f\e^r tho firct 

5 time point in the series of recordings, the recording being performed, e.g., with a predeter- 
mined time spacing of from 0.1 seconds to 1 hour, preferably from 1 to 60 seconds, more 
preferably from 1 to 30 seconds, in particular from 1 to 10 seconds, over a time span of from 
1 second to 12 hours, such as from 10 seconds to 12 hours, e.g., from 10 seconds to one 
hour, such as from 60 seconds to 30 minutes or 20 minutes. 

0 

9. A method according to any of claims 1-7, wherein the cell or cells is/are fixed at a point in 
time after the application of the influence at which the response has been predetermined to 
be significant, and the recording is made at an arbitrary later time. 



25 10. A method according to any of the preceding claims, wherein the luminophore is a lu- 
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11. A method according to any of the preceding claims, wherein the luminophore is a lu- 
minophore which Is capable of associating with a component which is capable of being re- 
distributed in manner which is physiologically relevant to the degree of the influence. 

5 12. A method according to any of the preceding claims, wherein the luminophore is a lu- 
minophore which is capable of being redistributed in a manner which is experimentally de- 
termined to be correlated to the degree of the influence. 

13. A method according to any of the preceding claims, wherein the luminophore is a lu- 
10 minophore which is capable of being redistributed, by modulation of the intracellular path- 
way, in substantially the same manner as the at least one component of the intracellular 
pathway. 

14. A method according to any of claims 1-13, wherein the luminophore is a luminophore 
15 which is capable of being quenched upon spatial association with a component which is re- 
distributed by modulation of the pathway, the quenching being measured as a decrease in 
the intensity of the luminescence. 

15. A method according to any of claims 1-13, wherein the variation or result with respect to 
20 the spatially distributed light emitted by the luminophore is detected by a change in the reso- 
nance energy transfer between the luminophore and another luminescent entity capable of 
delivering energy to the luminophore, each of which has been selected or engineered to be- 
come part of, bound to or associated with particular components of the intracellular pathway, 
and one of which undergoes redistribution in response to the influence, thereby changing the 

25 amount of resonance energy transfer, the change in the resonance energy transfer being 
measured as a change in the intensity of emission from the luminophore. 

the luminophore is sensed by a single channel photodetector which responds only to the av- 
30 erage intensity of the luminophore in a non-spatially resolved fashion 
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17. A method according to any of claims 1-16, wherein the property of the light being re- 
corded is intensity, fluorescence lifetime, polarization, wavelength shift, or other property 
which is modulated as a result of the underlying cellular response. 

1 8. A method according to any of claims 1 -1 5 or 1 7. wherein the recording of the spatially 
distributed light is performed using a recording system which records the spatial distribution 
of a recordable property of the light In the form of an ordered amay of values. 

19. A method according to claim 18, wherein the recording of the spatial distribution of the 
recordable property of the light is performed using a charge transfer device such as a CCD 
array or a vacuum tube device such as a vidicon tube. 

20. A method according to any of the preceding claims, wherein the light to be measured 
passes through a filter which selects the desired component of the light to be measured and 
rejects other components. 

21. A method according to any of the preceding claims, wherein the recording of the spatial 
distribution of the recordable property of light is performed by fluorescence microscopy. 

22. A method according to any of the preceding claims, wherein the recording of the varia- 
tion or result with respect to light emitted from the luminophore is performed by recording the 
spatially distributed light as one or more digital images, and the processing of the recorded 
variation to reduce it to one or more numbers representative of the degree of redistribution 
comprises a digital image processing procedure or combination of digital image processing 



23. A method according to any of claims 2-22. wherein the intracGliuiar pathway is an intra- 
cellular signalling pathway. 
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24. A method according to any of the preceding claims, wherein the luminophore is a fluoro- 
phore. 

5 25, A method according to any of the preceding claims wherein the luminophore is a 

polypeptide encoded by and expressed from a nucleotide sequence harboured in the cell or 

26. A method according to any of the preceding claims, wherein the luminophore is a hybrid 
10 polypeptide comprising a fusion of at least a portion of each of two polypeptides one of which 

comprises a luminescent polypeptide and the other one of which comprises a biologically 
active polypeptide, as defined herein. 

27. A method according to claim 26, wherein the luminescent DolvDeotide is a GFP as de- 
15 fined herein. 

28. A method according to claim 27 wherein the GFP is selected from the group consisting 
of green fluorescent proteins having the F64L mutation as defined herein. 

20 29. A method according to claim 28 wherein the GFP is a GFP variant selected from the 
group consisting of F64L-GFP, F64L-Y66H-GFP, F64L-S65T-GFP, and EGFP. 

30, A method according to any of the previous claims for detecting intracellular translocation 
of a biologically active polypeptide affecting intracellular processes upon activation, the 

75 method comnrisino 

a linker, to a biologically active polypeptide under conditions permitting expression of 
the nucleotide sequence, 
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b) modulating the activity of the biologically active polypeptide by incubating the cell or 
cells with a substance having biological activity and 

c) measuring the fluorescence produced by the incubated cell or cells and determining 
the result or variation v\^ith respect to the fluorescence, such result or variation being 

5 indicative of the translocation of a biologically active polypeptide in said cell. 

31 . A method according to claim 30, wherein the nucleotide sequence is a DNA sequence. 

32. A method according to claim 30 or 31, wherein the modulation is an activation. 

10 

33. A method according to claim 30 or 31 , wherein the modulation is a deactivation. 

34. A method according to any of claims 30-33 wherein the fluorescence of the cell or cells is 
n.caouicu piiui lu mc i noauidLiui I, dHCi tht! resuit ur vanation determinea in step (c) is a 

15 change in fluorescence compared to the fluorescence measured prior to the modulation. 

35. A method according to any of claims 30-34, wherein the intracellular processes are intra- 
cellular signalling pathways. 

20 36. A method according to claim 34, wherein the change in fluorescence measured in step 
(c) comprises determining a change in the spatial distribution of the fluorescence. 

37, A method according to any of the preceding claims wherein the mechanically intact living 
cell or cells is/are a mammalian cell/mammalian cells which, during the time peroid over 
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38. A method according to any of the preceding claims, wherein the at least one mechani- 
cally intact living cell is part of a matrix of identical or non-identical cells. 

39. A method according to any of claims 1-36 and 38, wherein the cell or cells is/are se- 

5 lected from the group consisting of fungal cells, such as a yeast cell; invertebrate cells in- 
cluding insect cells; and vertebrate cells, such as mammalian cells. 

40. A nucleic acid constmct coding for a fusion polypeptide comprising a biologically active 
polypeptide that is a component of an intracellular signalling pathway, or a part thereof, and 

10 a GFP, with the proviso that the construct is not a construct coding for a fusion polypeptide 
In which the biologically active polypeptide is selected from the group consisting of PKC- 
alpha, PKC-gamma, and PKC-epsilon. 

41. A nucleic acid construct coding for a fusion polypeptide comprising a biologically active 

1 5 polypeptide that is a component of an intracellular signalling pathway, or a part thereof, and 
an F64L mutant of GFP. 

42. A nucleic acid construct according to claim 40 or 41 , wherein the biologically active 
polypeptide is a protein kinase or a phosphatase. 

20 

43. A nucleic acid construct according to any of claims 40-42 wherein the GFP is N- or C- 
terminaily tagged, optionally via a peptide linker, to the biologically active polypeptide or part 
thereof. 

25 44. A nucleic acid construct according to any of claims 40, 41 and 43, wherein the biologi- 
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45. A nucleic acid construct according to any of claims 40. 41 and 43, wherein the biologi- 
cally active polypeptide is a protein, or a part thereof, which is associated with the cy- 
toskeietal network and which changes cellular localisation upon activation. 

46. A nucleic acid construct according to any of claims 40-43, wherein the biologically active 
polypeptide is a protein kinase or a part thereof which changes cellular localisation upon ac- 
tivation. 



47. A nucleic acid construct according to claim 46, wherein the protein kinase is a ser- 
ine/threonine protein kinase or a part thereof capable of changing intracellular localisation 
upon activation. 



48. A nucleic acid construct according to claim 46, wherein the protein kinase is a tyrosine 
protein kinase or a part thereof capable of changing intracellular localisation upon activation. 

15 

49. A nucleic acid construct according to claim 46, wherein the protein kinase is a phosphol- 
ipid-dependent serine/threonine protein kinase or a part thereof capable of changing intra- 
cellular localisation upon activation. 



50. A nucleic acid construct according to claim 46, wherein the protein kinase is a cAMP- 
dependent protein kinase or a part thereof capable of changing cellular localisation upon ac- 
tivation. 



51. A nucleic acid construct according to claim 50 which codes for a PKAc-F64L-S65T-GFP 
25 fusion. 



depenaent protein kinase or a part thereof capable of changing cellular localisation upon 
tivation. 



ac- 
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53. A nucleic acid construct according to claim 46, wherein the protein kinase is a 
calmodulin-dependent serine/threonine protein kinase or a part thereof capable of changing 
cellular localisation upon activation. 

54. A nucleic acid construct according to claim 46, wherein the protein kinase is a mitogen- 
activated serine/threonine protein kinase or a part thereof caoabie of chanoina cellular local- 
isation upon activation. 

55. A nucleic acid construct according to claim 54, which codes for an ERK1-F64L-S65T- 
GFP fusion. 

56. A nucleic acid construct according to claim 54, which codes for an EGFP-ERK1 fusion. 

57. A nucleic acid construct according to claim 46, wherein the protein kinase is a cyclin- 
dependent serine/threonine protein kinase or a part thereof capable of changing cellular lo- 
calisation upon activation. 

58. A nucleic acid construct according to claim 42 or 43, wherein the biologically active 
polypeptide is a protein phosphatase or a part thereof capable of changing cellular localisa- 
tion upon activation. 

59. A nucleic acid construct according to any of claims 40-58 which is a DNA construct. 



61 . A nucleic acid construct according to claim 60 in which the gene encoding GFP is the 

ce^p or-cnriina EGFP as de^:'T::: "^n-e:^^ 
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62. A nucleic acid construct according to claim 60 in which the gene encoding a GFP is a 
gene encoding a GFP variant selected from F64L-GFP, F64L-Y66H-GFP and F64L-S65T- 
GFP. 

63. A DNA construct according to claim 59 and 61 or, where applicable, 62, which is a con- 
struct as identified by any of the DNA sequences shown in SEQ ID NO: 38. 40, 42. 44. 46. 
48, 50, 52, 54. 56, 58, 60, 62, 64, 66, 68, 70. 72, 74, 76, 78, 108, 110, 112, 114, 116. 118, 
120, 122, 124. 126, 128, 130, 132. 134, 136, 138, 140, and 142, oris a variant thereof ca- 
pable of encoding the same fusion polypeptide or a fusion polypeptide which is biologically 
equivalent thereto, as defined herein. 

64. A cell containing a nucleic acid construct according to any of claims 40-63 and capable 
of expressing the sequence encoded by the construct. 

65. A cell according to claim 64, which is a eukaryotic cell. 

66. A cell according to claim 64, which is selected from the group consisting of fungal cells, 
such as yeast cells; invertebrate cells, including insect cells, and vertebrate cells, such as 
mammalian cells. 

67. A ceil according to claim 66, which is a mammalian cell. 

68. An organism carrying in at least one of its component cells a nucleic acid sequence as 
contained in the constructs accordinq to any of claims 40-59. said cell being capable of ex- 



69. An organism according to claim 68 which is selected from the group consisting of uni- 
cellular and multicellular organisms, such as a mamma!, 
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70. A fluorescent probe comprising a GFP which is N- or C-terminally tagged, optionally via a 
peptide linker, to a biologically active polypeptide or a part or a subunit thereof which is a 
component of a intracellular signalling pathway as defined herein, the probe being a probe 

5 which is encoded by the nucleic acid construct according to any of claims 40-59. 

71. A method according to any of claims 1-39. wherein the luminophore is a fusion doIvdcd- 
tide as encoded by the nucleic acid construct according to any of claims 40-63. 

10 72. A method according to any of claims 1-39 or 71 in which the method of the invention is 
used in a screening program as defined herein. 

73. An apparatus for measuring the distribution of fluorescence in at least one cell, and 
thereby any change in the distribution of fluorescence in at least one cell, which includes the 

15 fallowing component parts: (a) a light source, (b) a means for selecting the waveiength(s) of 
light from the source which will excite the fluorescence of the protein, (c) a means for rapidly 
blocking or pass ing the excitation light into the rest of the system, (d) a series of optical 
elements for conveying the excitation light to the specimen, collecting the emitted fluores- 
cence in a spatially resolved fashion, and forming an image from this fluorescence, (e) a 

20 bench or stand which holds the container of the cells being measured in a predetermined 
geometry with respect to the series of optical elements, (f) a detector to record the spatially 
resolved fluorescence in the form of an image, (g) a computer or electronic system and as- 
sociated software to acquire and store the recorded images, and to compute the degree of 
redistribution from the recorded images, 

25 

74, An apparatus according to claim 73 in which some or all of the system is automated, 

cence microscope. 

30 
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76. An apparatus according to claim 73 in which component f is a CCD camera. 

77. An apparatus according to claim 73 in which the image is formed and recorded by an 
optical scanning system. 

5 

78. An apparatus according to claim 73 in which a liquid addition system is used to add a 

tJiKMivyvvi I owiH^juuiiu Lu any ui du ui Liiti (jtjiib III irtfcj ceii noiuer at a time aeierminea 
in advance. 

10 79. An apparatus according to claim 78 in which the liquid addition system is under the con- 
trol of the computer or electronic system. 

80. A method according to any of claims 1-79 wherein the method is a screening program for 

the identification of a bioloaicallv active suhstan Ce as definpH hprtiin that Hirprfl>/ nr inHirontk/ 

15 affects an intracellular signalling pathway and is potentially useful as a medicament, wherein 
the result of the individual measurement of each substance being screened which indicates 
its potential biological activity is based on measurement of the redistribution of spatially re- 
solved luminescence in living cells and which undergoes a change in distribution upon acti- 
vation of an intracellular signalling pathway. 

20 

81 A method according to any of claims 1-79 wherein the method is a screening program for 
the identification of a biologically toxic substance as defined herein that exerts its toxic effect 
by interfering with an intracellular signalling pathway, wherein the result of the individual 
measurement of each substance being screened which indicates its potential biologically 
25 toxic activity is based on measurement of the redistribution of said fluorescent probe in living 
cells and which undergoes a change in distribution upon activation of an intracellular sianal- 



82. A method according to any of claims 1-80 wherein a fiuorescent probe is used in back- 
30 tracking of signal transduction pathways as defined herein, 
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83, A method of treating a condition or disease related to the intracellular function of a pro- 
tein kinase comprising administering to a patient suffering from said condition or disease an 
effective amount of a compound which has been discovered by any method according to the 

5 invention. 

84. A compound that modulates a comoonent of an intracellular oathwav as defined herein- 
as detemnined by a method according to the method of the invention, 

10 85. A medical composition comprising a therapeutic amount of a compound identified ac- 
cording the method of the invention. 

86. A method of selectively treating a patient suffering from an ailment which responds to 
medical treatment comprising obtaining a primary cell or cells from said patient, transfecting 

15 the cell or cells with at least one DNA sequence encoding a fluorescent probe according to 
the invention, culturing the cell or cells under conditions permitting the expression of said 
probes and exposing it to an array of medicaments suspected of being capable of alleviating 
said ailment, then comparing changes in fluorescence patterns or redistribution patterns of 
the fluorescent probes in the intact living cell or cells to detect the cellular response to the 

20 specific medicaments (obtaining a cellular action profile), then selecting a medicament(s) 
based on desired activity and acceptable level of side effects and administering an effective 
amount of said medicament(s) to said patient. 

87. A method according to any of claims 1-80 of identifying a drug target among the group of 
25 biologically active polypeptides which are components of intracellular signalling pathways. 
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Fig 4 
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Fig 5 
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Fig 7 
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Fig 9 
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Fig. 12 
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